Quantum mechanics from classical statistics 
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Quantum mechanics can emerge from classical statistics. A typical quantum system describes an 
isolated subsystem of a classical statistical ensemble with infinitely many classical states. The state 
of this subsystem can be characterized by only a few probabilistic observables. Their expectation 
values define a density matrix if they obey a "purity constraint" . Then all the usual laws of quantum 
mechanics follow, including Heisenberg's uncertainty relation, entanglement and a violation of Bell's 
inequalities. No concepts beyond classical statistics are needed for quantum physics - the differences 
are only apparent and result from the particularities of those classical statistical systems which 
admit a quantum mechanical description. Born's rule for quantum mechanical probabilities follows 
from the probability concept for a classical statistical ensemble. In particular, we show how the non- 
commuting properties of quantum operators are associated to the use of conditional probabilities 
within the classical system, and how a unitary time evolution reflects the isolation of the subsystem. 
As an illustration, we discuss a classical statistical implementation of a quantum computer. 
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I. INTRODUCTION 

A realization of quantum mechanics as a classical statis- 
tical system may shed new light on the conceptual inter- 
pretation of experiments based on entanglement, as tele- 
portation or quantum cryptography [1]. One may even 
speculate that steps in quantum computing could be 
realized by exploiting classical statistics. Recently, classi- 
cal statistical ensembles that are equivalent to four-state 
and two-state quantum mechanics have been constructed 
explicitly @,||. This constitutes a proof of equivalence of 
few-state quantum statistics and particular classical statis- 
tical ensembles with infinitely many degrees of freedom. In 
view of the particular manifolds of classical states for these 
examples one may wonder if quantum statistical systems 
are very special cases of classical statistics, or if they arise 
rather genuinely under certain conditions. In this paper 
we argue that quantum statistics can indeed emerge rather 
generically if one describes small "isolated" subsystems of 
classical ensembles with an infinity of states. 

An atom in quantum mechanics is an isolated system 
with a few degrees of freedom. This contrasts with quan- 
tum field theory, where an atom is described as a particular 
excitation of the vacuum. The vacuum in quantum field 
theory is a complicated object, involving infinitely many 
degrees of freedom. In a fundamental theory of particle 
physics, which underlies the description of atoms, collec- 
tive effects, such as spontaneous symmetry breaking, are 
crucial for its understanding. Our treatment of atoms in 
the context of classical statistics is similar to the conceptual 
setting of quantum field theory. A classical statistical sys- 
tem with infinitely many states describes the atom and its 
environment or the atom and the vacuum. The quantum 
statistical features become apparent if one concentrates on 
a subsystem that describes the isolated atom. 

Only a small part of the information contained in the 
probability distribution for the classical statistical ensem- 
ble is used for the description of the properties of the sub- 
system. Statistical subsystems are also relevant if the de- 
scription of the atom does not use all the possible "micro- 



scopic information" which could, in principle, be available 
on length scales many orders of magnitude smaller than 
the size of the atom. The subsystem corresponds then to a 
"coarse grained approach" , for example ignoring the con- 
stituents of the atomic nucleus. Furthermore, the prob- 
ability distribution at a given time can be interpreted as 
a subsystem of a distribution for probabilities of events 
at different times. The part of the information which is 
contained in the probability distribution for the classical 
statistical ensemble but not used for the subsystem will 
generically be called "environment" . The detailed proper- 
ties and physical meaning of the environment will not be 
important for the emergence of quantum structures. 

Our classical statistical description of quantum systems 
has four crucial ingredients. (1) Probabilistic observables 
have in a given quantum state only a probabilistic distri- 
bution of possible measurement values, rather than a fixed 
value as for classical observables in a given state of the 
classical ensemble. Probabilistic observables obtain from 
classical observables by "integrating out" the environment 
degrees of freedom. The fact that this map is not invertible 
avoids conflicts with the Kochen-Specker theorem [f| |(|. 
The probabilistic nature of the observables can be under- 
stood as a result of "coarse graining of the information", 
starting from classical observables on a suitable level. Al- 
ternatively, this concept may be used as a basic definition 
of observables @],[H]. 

(2) Incomplete statistics characterizes the subsystem. 
This means that the joint probability for finding a mea- 
surement value a for the "system observable" A, and b 
for a second system observable B, cannot be computed 
from the state of the subsystem alone for all pairs (A,B). 
Typically, joint probabilities require information about the 
precise state of the environment. 

(3) Conditional probabilities are used for a computation 
of the probabilities for the possible outcomes of two mea- 
surements of observables A and B. In particular, if B is 
measured after A, the outcome of the measurement of B 
depends on the previous measurement of A. The classical 
correlations (A ■ B) are not uniquely defined for the quan- 
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turn system - they depend on detailed properties of the 
environment. We argue that the appropriate conditional 
correlations for measurements of properties of the isolated 
subsystem should only require information which charac- 
terizes the subsystem. With this requirement the condi- 
tional probabilities induce the concept of quantum corre- 
lations, and we propose that quantum correlations rather 
than classical correlations should be used for a description 
of measurements of pairs of two observables. This avoids 
conflicts with Bell's inequalities which apply for com- 

plete statistics if the classical correlation is used . 

(4) The unitary time evolution of quantum mechanics is a 
special case of a more general classical evolution which can 
also describe the phenomena of decoherence and syncoher- 
ence. We propose that the unitary time evolution reflects 
the isolation of the subsystem and corresponds to a partial 
fixed point (or better "fixed manifold" ) of the more general 
classical time evolution of the probability distribution. 

A classical statistical description of quantum mechanics 
should not be confounded with a deterministic description. 
The no go theorems for large classes of local deterministic 
"hidden variable theories" remain valid. We rather take 
the attitude that a probabilistic approach is appropriate 
for the basic setting of both the quantum and the classi- 
cal world. Deterministic behavior is a particular case (al- 
beit rather genuine) which can originate from the collective 
properties of many degrees of freedom, as the laws of ther- 
modynamics, or the motion of planets which arc composed 
of many atoms. Other deterministic features arise from 
the discreteness of the spectrum of quantum observables, 
as the energy levels in atoms. The classical statistical en- 
sembles of this paper should therefore not be reduced to 
the concept of a finite large number of point particles for 
which locations and momenta are given, in principle, in a 
deterministic way. 'Classical statistics" means the descrip- 
tion in terms of an ensemble of classical states for which 
observables take fixed values, and a classical probability 
distribution for these states which changes in time by a 
"deterministic" evolution equation. 

We demonstrate in this paper that all features of quan- 
tum mechanics can be described by a classical statisti- 
cal ensemble if appropriate observables for the subsystem 
are selected, if sequences of measurements are described 
by correlations compatible with the subsystem, and if the 
probabilities for the classical states follow a suitable time 
evolution. The strategy of our approach is the following. 
Out of the infinitely many classical observables that can, 
in principle, be measured in the system and its environ- 
ment, we select subclasses of system and quantum observ- 
ables which only measure properties of the system. For 
suitable quantum observables and their measurement cor- 
relations we show that they have all the properties of the 
observables associated to the non-commuting operators in 
quantum mechanics. 

We begin the discussion in sect. [IT] with a simple exam- 
ple of a classical realization for the qubits of a quantum 
computer. While this does not yet reveal all features of 
quantum physics, we present in the following sections a sys- 
tematic discussion how all characteristics of quantum me- 



chanics are realized in classical statistical ensembles with 
appropriate properties. In sect. IIIII we describe the con- 
cept of probabilistic observables for the subsystem and we 
present explicit classical ensembles which realize all aspects 
of two-state and four-state quantum systems in sect. IIVI 
Sect. |V]discusses the correlations between measurements of 
"quantum observables" which can be computed from the 
information contained in the state of the subsystem. In 
sect. |Vl] we turn to the unitary time evolution of the state 
of an isolated subsystem. Sect. I VIII is devoted to a detailed 
discussion of properties of quantum observables within our 
classical statistical setting. In particular, we discuss the 
issue of entanglement and Bell's inequalities in sect. IVIIII 
After a detailed discussion of sequences of measurements 
and their relation to the quantum mechanical commutator 
of two operators in sect. IIX1 we conclude in sect. [X] 



II. CLASSICAL STATISTICAL QUANTUM 
COMPUTER 

In this section we discuss simple examples of classical sta- 
tistical ensembles that realize certain aspects of quantum 
mechanics. The demonstration that all properties of quan- 
tum mechanics can emerge from a classical setting, and 
a more systematic discussion under what circumstances 
this happens, will be given in the following sections. Here 
we describe the operations of a quantum computer purely 
within the framework of a time evolution of probabilities 
for the states of a classical statistical ensemble. 

Qubit 

We start with a two level observable A which can take 
only two values +1 and — 1. Equivalently, we can identify 
A = 1 with the value 1 for a bit, while A = — 1 corre- 
sponds to the value 0. We consider families of statistical 
ensembles, where a particular ensemble is specified by a 
particular probability distribution for the "classical states" 
of the ensemble. For every classical state r the observable 
has one of the values A T = 1 or A T = — 1 , and the prob- 
ability for the state p T obeys < p T < 1 , J2 r p T = 1- 
The time evolution of the system is described by the time 
dependence of the probabilities p T (t), defining a trajectory 
within the family of ensembles. 

In order to compute the expectation value (A) we do 
not need to know the complete probability distribution of 
the ensemble {p T }- It is sufficient to know the probability 
ui+ for A having the value +1, and similarly u>_ for A = 
— 1. Since u>+ + w_ = 1, one of the probabilities w±, or 
the relative ratio of probabilities u>+ / w- , is sufficient to 
compute 

(A) =w+-w— (1) 

In the following we assume that w± can be computed 
in terms of three real numbers pk obeying p\ < 1. In 
other words, the complete statistical information contained 
in the probability distribution {p T } is not necessary for a 
determination of (A) , which only requires the part of the 
information contained in the three numbers pk- This is a 
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typical situation for an isolated subsystem, whose behav- 
ior can be described by the state of the subsystem which is 
specified by (pk), independently of the detailed state of the 
environment. The statistical information which character- 
izes the probability distribution of the ensemble beyond the 
three numbers pk describes the state of the environment. 

We will not need the detailed properties of the environ- 
ment. As a concrete example we may consider a classi- 
cal ensemble with eight states labeled by r = 1,...,8, 
characterized by the probabilities p T > for each state 
(J2 t Pt = !)■ Instead of r we can use a triple- index 
(ci, 02, 03), with <jj — ±1. If we define 

Pj = ^(o'l-^erg), (2) 

C 1 ,0'2 ,G3 

three linear combinations of the probabilities p{o\, 02, 03) 
specify the state of the subsystem, while the remaining four 
independent linear combinations characterize the environ- 
ment. We may use 

p(CTi,cr 2 ,cr 3 ) = p s (ai, 0-2,0-3) +Sp e (oi,02,o 3 ) (3) 

with 

p a ((Ti, o 2 ,o 3 ) = -(1 + Oxpx)(l + cr 2 p 2 )(l + 0-3/03) (4) 
and Sp e obeying 

^ 8pe(oi,02,0 3 ) = , ^ Vj5Pe{oi,02,03) = 0. 

(5) 

Different choices of Sp e differ then only in properties of 
the environment, while pj is independent of Sp e . As far 
as only the properties of the subsystem are concerned all 
8p e obeying the condition ([5]) and < p s + Sp e < 1 are 
equivalent. Typically, the environment may be far more 
extended, with (infinitely) many states r. For example, r 
could be characterized by other labels beyond (01,02,03) 
which do not influence the pj and are summed over in eq. 
^ . We note that in eq. ^ the split between the environ- 
ment and the system cannot be done by simply associating 
some of the "classical bits" Oj to the system and others to 
the environment. 

We will consider a simple dependence w+(pk) given by 

(A) =p 3 , w+= -{I + P3). (6) 

In order to describe the time evolution of (A) we need to 
specify the time evolution of the state of the subsystem. 
Let us investigate rotations 

Pk(t,t') = S k i(t,t') Pl (t') , SS T =1, (7) 

as implemented by an evolution equation 

J^ fc = T MPl , (T) T = -T. (8) 

(Sums over repeated indices are implied.) As a particular 
example we may choose a rotation in a plane diagonal to 



the 1 — 2 and 2 — 3 planes with ip linearly increasing with t 

S= _ (9) 

cos 2 <p , \[2 sin tp cos ip , sin 2 ip \ 
— \[2 sin p cos ip , 1 — 2sin 2 iy9 , \/2 sin p cos (p 
sin 2 tp , —^/2smip cosip , cos 2 <p J 

where 

/ 0, 1, 0\ 
T = V2ip -1, 0, 1 . (10) 

V 0, -1, oj 

With p kfl = p k {p = 0) eq. yields 

(A(t)) = sin 2 <ppi t Q - \/2sin^cos(^p2,o + cos 2 pp3,o- (H) 
For EkPl 

— 1 this describes a typical precession pattern 
of a spin in quantum mechanics in an appropriately cho- 
sen homogeneous magnetic field. While (A) only requires 
the knowledge of P3, the time evolution of P3 also involves 
the two other variables characterizing the state, p\ and p2- 
These variables will contain the information about the cor- 
relations which are characteristic for a quantum system. 

Rotations with arbitrary angles ip can be realized on the 
classical level by a time evolution of the probabilities p T 
according to eq. ([2]). For this purpose it is sufficient that 
p s (oi, 02,03) in eqs. (g]) evolves according to the ro- 
tations of pj. This may be accompanied by an arbitrary 
evolution of Sp e (oi, 02, 03) which ensures that the condi- 
tion ([5]) and < p s + 8p e < 1 continues to hold. The 
simplest example is Sp e = for all t. However, the precise 
time evolution of 5p e has no influence on the evolution of 
Pk{t) and (A(t)). The time evolution of the subsystem is 
decoupled from the environment. As a special example we 
may consider ip(t = A) = 7r/2 with 

P3(t) = Pl,0 , Pl{t) = P3,0 , P2(t) = ~p2,0, (12) 

represented by 

p s (oi,02,o 3 ;t) = p s (o3,o 2 ,o 1 ;0), 
p s (oi,02,o 3 ;t) = p s (o3,-o 2 ,o 1 ;0), (13) 

independently of Sp e (t) or 5p e (0). 

For p2Pk < 1 the three numbers pk can be represented 
by a 2 x 2 hermitean density matrix in terms of the Pauli 
matrices Tk 

P=\{^ + PkT k )- (14) 

In turn, a "pure state density matrix" , which obeys 
PkPk — 1, can be expressed in terms of a normalized 
two-component complex vector ip a - a "quantum state" 
or "qubit" - as 

Pafi = Ipalpp ) C^a = 1, (15) 

such that the expectation value of A obeys the quantum 
rule (A = r 3 ) 

(A) = r a (r3) a ^0 = WA\^). (16) 



4 



Rotations of the vector (pk) act on ip a as unitary transfor- 
mations 



l/ia(t) = U a p(t)ll}()(0). 



(17) 



Inversely, for any Hamiltonian of a two-state quantum 
system we can construct the evolution operator U and the 
associated evolution of pk(t) by using eq. (fl~4"l) and 



p a0 (t) = U a7 (t)p 75 (0)Ujp(t) 



(18) 



With eqs. (3]), we can infer the classical probability dis- 
tributions p T (t) whose time evolution precisely reproduces 
the time evolution of the expectation value of the observ- 
able A, which is associated in the quantum system to the 
operator A — t 3 . For example, we can interprete this ob- 
servable as the third component of the spin in appropriate 
units, A = S3. Simultaneously, we can also describe by eq. 
(fl"6)) , or more generally by 



(A) = tv(pA), 



(19) 



the time evolution of the orthogonal spin components as- 
sociated to the operators T\ and r 2 . 

Classical statistical realization of quantum gates 

The evolution (fT!?]) , (fTB"]) realizes the "Hadamard gate" 
for a qubit, with associated unitary evolution operator 



V2U- 1 



(20) 



(The overall phase of U is without physical significance, 
since the overall phase of drops out in eq. (fT5|) .) One 
could realize the Hadamard transformation by a continuous 
time evolution of a classical ensemble with probabilities 
p T (t) changing according to eq. ([9]), with 



, . nt 



(21) 



such that for a "read out" at t = A the angle ip = ^ realizes 
eq. (|T2|) . Alternatively, we may associate the probabilities 
p T (t = A) with a distribution of random numbers, which 
are obtained from random numbers at t = by eq. (I12p . 

Other quantum gates can be realized similarly. For ex- 
ample, the j -phase shift 



U 



1, 
0, 



(22) 



can be realized by a rotation of (pk) in the 1 — 2 plane 



T 



and 




S 



cos ip, sin tp, 
-shop, cos (/j, 
0, 0, 1 



<P = 



irt 
4A' 



(23) 



(24) 



For t — A this amounts to 



pt(t) = ^=( P1 (0) + P2 (0)), (25) 

P2(t) = -j=(-pi(0)+ P 2(0)), 

Ps(t) = P3 (0). 

We can also construct classical ensembles realizing quan- 
tum gates acting on two qubits, as the "controlled not gate" 
or CNOT-gate represented by 



U 



/l, 0, 0, 

0, 1, 0, 

0, 0, 0, 1 

\0, 0, 1, 



(26) 



For this purpose we enlarge our classical ensemble and con- 
sider a subsystem characterized by 15 real numbers pk 
obeying pkPk < 3. They can be grouped in to a 4 x 4 
density matrix 

9 = j(l + PkLk) , tr(L k Li) = 4S H , (27) 

with Lk appropriately normalized generators of SU (4) 
given in a direct product basis by Q 



Li 
L 4 
L 7 

L13 



r 3 ® 1 , L 2 = 1 r 3 , L 3 = t 3 (g) r 3 , 
1 ® r x , L 5 = 1 ® r 2 , L 6 = T 3 (g) Tx , 
t 3 <g> r 2 , = n ® 1 , Lg = t 2 ® 1, 
ri <g> r 3 , Lu = r 2 <g> r 3 , L12 = n <g> n, 

Tl ® T 2 , L14 = — T2 <g> T 2 , L X 5 = T 2 <g Tl- 



(28) 



Pure states obey p 2 = p. They can be associated to a 
complex four-component vector ip according to eq. (|15p . 
which describes the quantum states of four-state quantum 
mechanics or two qubits. The unitary transformation (|26[) 
is associated to a transformation of the density matrix p 
and the fifteen-component real vector (pk) according to 



pi = UpU^ , p' k = tv(L k p'), 



(29) 



resulting for the density matrix p in the exchange of ele- 
ments 



Pl3 <-»■ Pl4 , P 2 3 /°24 , P31 j»41 , P32 
P33 *-> P44 , P34 *-> P43, 

and for the vector (pk) in 

Pi <-> P3 , P5 <"> P7 , P8 <"> Pl2, 
P9 <"> Pl5 , PlO «"> Pl4 , Pll <-> Pl3 



P4 2 , 



(30) 



(31) 



Eq. (T5T|) corresponds to a particular rotation of the 15- 
component vector (pk)- 

For a realization in terms of classical statistics we may 
employ an ensemble with 2 15 states r. They can be labeled 
as {crfc} = ((Ti, cr 2 , . . . , (T15) with fc = 1, . . . , 15 and ct 2 = 1. 
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We can again use a classical probability distribution of the 
type §5§ with 

PsiWk}) =2- 15 H(l + a kPk ), (32) 

k 



and 



MW) = . E "iMW) = 0. (33) 



Again, one finds 



= E ^(W) 



(34) 



WI 



and the rotations among the p k are realized by an evolution 
of p s with time varying p k , while the evolution of the "en- 
vironmental information" 5p e {<7k} is arbitrary as long as 
the constraints (f3"3"| are obeyed and < p s +5p e < 1. This 
demonstrates that the time evolution of classical probabil- 
ities can realize the CNOT gate, even though the number 
of classical states 2 15 is very high. (We do not attempt 
here to discuss possible classical realizations of the CNOT 
gate with a smaller number of classical states.) 

Also the Hadamard gate and the 7r/4-phase shift can op- 
erate separately on qubit 1 and qubit 2. For qubit 1 we 
replace in eqs. dHJ , ([23: (pi,P2,p3) -> (p8,p9,Pi), while 
for qubit 2 the replacement is {p\,p2-,Pz) — >■ (Pi, P5> P"i)- 
The different gates can operate one after the other if we 
switch the evolution of probabilities after a time step A to 
the evolution characteristic for the next gate. A sequence of 
two gates corresponds to a sequence of two rotations of the 
vector (pk) or the associated classical probabilities p({crk}) ■ 
Obviously, the order of the sequence matters since in gen- 
eral rotations are not commutative. 

Classical statistical realization of a quantum 
computer 

We have now most elements for a description of a quan- 
tum computer in terms of the time evolution of probabil- 
ities for the states of a classical statistical ensemble. Our 
purpose is here purely an existence proof on the conceptual 
side, rather than a proposal for a practical realization which 
could speed up computations for certain types of problems. 
It is meant as an introductory example how classical statis- 
tics can be linked to quantum mechanics. Beyond the op- 
eration of quantum gates we still need a description of the 
initialization of some algorithm, the readout of results of a 
computation and the scaling to a larger number of qubits. 

Initialization means the preparation of an initial state 
for the qubits on which quantum gates can operate con- 
secutively in discrete time steps A. A convenient initial 
state has for all qubits. In our two-qubit example this 
means that the two observables associated to the gener- 
ators L\ and Li should have expectation value —1, i.e. 
pi = p2 = — 1. We note that the generators L% and L2 
commute, L1L2 = £2-^1 = L3. We want to start with a 
pure state which requires pkPk — 3 and therefore choose 
ps = 1. For all other p k vanishing, pk>4 = 0, the density 
matrix 

p = ~(1 - L x - L 2 + L a ) = diag{0, 0, 0, 1) (35) 



describes indeed a pure state, with associated wave func- 
tion ip = (0,0,0, 1). The associated initial classical proba- 
bility distribution is 



p(Wk}) 



5 (l-a 1 )(l- ( 7 2 )(l+ ( 7 3 )+^ e ({ ( 7 fc }). (36) 



For a certain number of time steps A the classical prob- 
abilities p({o"fc}) are assumed to change in a way such that 
a sequence of CNOT, Hadamard or phase shift quantum 
gates are operated on the two qubits. After this sequence 
of gates the classical probability distribution has reached a 
"final value" pfyi&k}) which one wants to "read out" by 
a measurement of the two qubits. The read out is done by 
measuring two two-level observables A^\A^ which can 
take the values ±1. The respective probabilities of finding 
+1 or —1 are related to the expectation values similar to 
eq. (TT]), with 



(A^)=p 1 (t f ), (AW)= P2 (tf), 



(37) 



and pi : 2(tf) determined from p/({ofc}) according to eq. 
(|34p . In other words, the probability for finding for qubit 1 

the value 1 is given by =5(1 + Pi (£/)), an d for qubit 
2bywf =i(l + p 2 (t f )). 

As appropriate for two qubits there are four possible out- 
comes of the read out (1, 1), (1,-1), (—1, 1), and (—1, —1). 
(We use here the notation —1 for the value of the bit, 
according to the values of the two-level-observables A^ k \) 
For a given initial state and a well defined sequence of quan- 
tum gates, one may wonder if it is possible to compute the 
joint probabilities p++,p^ — ,p — \-,p~- for the observables 
A^ and A^ 1 having values according to these four possi- 
bilities. Since p ++ + p_| +p |_ + p~- = 1 this requires 

three independent numbers. The values pi(tf) and pi{tf) 
provide only two of them. They are not enough for a com- 
putation of the joint probabilities that qubit 1 has the value 
+1 and qubit 2 the value —1, etc.. The missing piece of 
information is 



Ci 



E 



cri<J2p(Wk}) =p+++p — -p+--p- + . (38) 



Together with p\ = p ++ + p^ — p |_ — p and P2 = 

p ++ — p^ + p |_ — p this would yield the necessary 

information for the computation of the four joint probabil- 
ities p ++ , p_| , p |_ and p separately. 

In general, the quantity 



C *3 = E a i a jP(i a k}) 



(39) 



depends on Sp e and therefore on the detailed properties of 
the environment. It is then not a property of the subsys- 
tem alone and therefore not available for a prediction of 
"readouts" by subsystem data only. For the initial state 

(j35[) we have p++ = P-\ = p 1- = 0, p = 1 and we can 

identify for t = 



C\2 = P3 = P\Pi- 



(40) 



Eq. ([4*11)1 holds for t / only if the "read-out" state is precisely 
a simultaneous eigenstate of A^ 1 and A^ 2 \ Only for this 



6 



special case we can use the "system data" p% in order to 
predict the joint probabilities, while for general states this 
is not possible. 

We will argue that the joint probabilities p ++ etc. arc 
actually not what is needed for a prediction of the outcome 
of the four possibilities of the readout. The readout of qubit 
1 is a measurement, and in general measurements change 
the ensemble. What is needed is the probability w ++ to 
find for A^ 1 the value +1 if is measured to have the 
value +1. (We will see in sects. [V] |lX]that the order of 
the measurements does not matter in our case.) Thus the 

conditional probabilities w ++ and w.\ for finding A^ 

■ I under the condition A^ = 1 are needed, and similarly 

w |- and for a measurement of A^ if yl' 1 ) = — 1. 

We will see in sect. [V] that the conditional probabilities 
can be expressed by system data and therefore predicted 
without knowing details of the environment. In particular, 
one finds 

w ++ — — to |- + w__ = P3. (41) 

Together with tol = w++ + u>-\ , w+ = + w y, 

which involve p\ and p2, all conditional probabilities can 
indeed be expressed in terms of p\,pi and p%. The proba- 
bilities for the four different readout results can therefore 
be predicted using the subsystem data alone. 

The predictability of probabilities w ++ etc. is crucial for 
performing quantum computations for which the 2^ alter- 
natives associated to the readout of Q quantum bits can be 
used. If quantum computing only involves the state of the 
system independently of the unknown state of the environ- 
ment, the computability of the conditional probabilities in 
terms of pk is a key ingredient. This brings us to the im- 
portant issue which type of correlations should be used for 
measurements of properties of the subsystem. The condi- 
tional probabilities fixed by eq. (|41|) define a "measurement 
correlation" 

(A^A^) m = p 3 . (42) 

This is distinct from the "classical" or "ponintwise" corre- 
lation 

{AW-AW) = C a . (43) 

While the measurement correlation is computable in terms 
if the state of the subsystem, the classical correlation is not, 
since it involves details of the environment. The outcome 
of a "good measurement" in an isolated subsystem should 
only depend on the state of the subsystem and not on the 
state of the environment. This excludes the use of the 
classical correlation. We will discuss this important issue 
for the understanding of quantum systems in more detail 
in sect. IV1 

Finally, we briefly discuss the scaling for our classical 
statistical implementation of a quantum computer for an 
arbitrary number of Q qubits. One can associate Q qubits 
to M-state quantum mechanics with M = 2®. In turn, the 
most general density matrix involves K = M 2 — 1 compo- 
nents of the vector pk- All quantum gates can be realized 



by rotations of (pfe). Generalizing our construction of as- 
sociated classical probabilities p T involves S — 2 K classical 
states. We observe that the number of classical states 

S = 2V~V (44) 

increases very rapidly with Q. Large numbers of classical 
states are characteristic for a classical statistics implemen- 
tation of quantum mechanics. Realizing spin observables 
with an arbitrary direction involves already an infinity of 
classical states [4]. We emphasize, however, that most of 
the information contained in the probability distribution 
for the classical ensemble {p T },i~ = 1...S, only concerns 
properties of the environment. The state of the quantum 
system involves only K real numbers pk or M complex 
numbers ip a in case of pure states. 



III. OBSERVABLES AND EXPECTATION 
VALUES 

The quantum features encountered in the preceding sec- 
tion can be generalized by addressing systematically the 
embedding of a subsystem in a more extended classical 
ensemble that also includes the environment or the vac- 
uum. We discuss in the following sections systems that 
correspond to a finite number of quantum states M. The 
generalization to M — > oo, needed for quantum observables 
with a continuous spectrum, is straightforward, as shown 
for a quantum particle in a potential in ref. [12j |. 

Probabilistic observables 

The most crucial effect of the embedding of the sub- 
system into a classical statistical ensemble, typically for 
infinitely many classical states, is the appearance of proba- 
bilistic observables for the description of the subsystem. 
For a given state of the subsystem - which will be as- 
sociated with a quantum state - they have a probability 
distribution of values rather than a fixed value as for the 
standard classical observables in a classical state. For sub- 
systems that are equivalent to M-state quantum mechanics 
the spectrum of the possible outcomes of measurements for 
the probabilistic observables contains at most M different 
real values 7„. In a given quantum state the probabilistic 
observable is characterized by probabilities w a to find "f a , 
where w a depends on the state. The simplest example are 
two-level observables, which can resolve only one bit, such 
that 7i = 1 , 72 = —1. 

We may use the example of the observable A in the pro- 
ceeding section, where for a given state of the subsystem 
characterized by {pi, p2, Pz) only the probability w+(pk) 
for finding the value A = 1 is known. In fact, the emer- 
gence of probabilistic observables from a reduction of ef- 
fective degrees of freedom is a generic feature in statisti- 
cal physics. As another example, we may group classi- 
cal states r = (er, (3) into collective states a by summing 
over /3, with probabilities for the collective states given by 
Pu = ^2pP(a,0)- Then a classical observable A which has 
a sharp value ±1 for every state (c, (3) will have only a 
probability distribution of values ±1 in the state a, where 
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the probability to find the value +1 in the state a 

obtains by summing the probabilities PUj,ff) over all (3 for 
which Ate^-s = 1. 

Let us consider a subsystem that can be described by 
n discrete classical two-level-observables A^ k \ We assume 
that these observables form a basis in a sense to be speci- 
fied later. The simplest quantum mechanical analogue for 
n — 3 is two-state quantum mechanics, with A^ corre- 
sponding to three orthogonal "spins" in an appropriate 
normalization. This may be viewed as an atom with spin 
one half where only the spin degree of freedom is resolved, 
as for example in Stern-Gerlach type experiments. 

The subsystem is embedded into a classical statistical 
ensemble with infinitely many states labeled by r. On 
this level the observables A^ are standard classical or de- 
terministic observables. They can only take fixed values 
A T k ^ = ±1 for any state r of the classical statistical ensem- 
ble. For arbitrary n, we denote the expectation value or 
average of A^ by pk, 

Pk = = Y,Pt A( t } > -!</*<!• (45) 

T 

The classical probabilities obey p T > for all states r of the 
classical statistical ensemble. As usual, one has ^ T p T = 1 . 
However, not all possible classical probability distributions 
{p T } obeying these conditions correspond to quantum sys- 
tems, and we will discuss restrictions below. 

The probabilistic observables associated to these clas- 
sical observables have probabilities w± = (1 ± p&)/2 to 
find 7i.2 = ±1. We can compute w^t by summing the 

(k) 

probabilities p T over all classical states for which A)- = 1 , 

(k) 

and similarly for w_ . This maps the classical observable, 
as characterized by the values At in every classical state 

(k) 

r, to a probabilistic observable characterized by 7a and 

0) 

Wa ■ 

The map from the classical observables to the proba- 
bilistic observables is not invertiblc. A different classical 
observable A'( k \ with different values A^ in the classi- 
cal states, may be mapped to the same probabilistic ob- 
servable A^ k \ For this it is sufficient that the probabil- 

(k) 

ities w± are the same for every probability distribution 
{p T } which corresponds to a quantum system. This lack of 
invertibility avoids conflicts of our classical statistical de- 
scription with the Kochen-Specker theorem !5j, as we will 

discuss in sect. IVIIl Since the classical observables At 
contain much more information than the associated proba- 
bilistic observables, they "measure" properties of both the 
quantum system and its environment. The transition to 
probabilistic observables "integrates out" the environment 
degrees of freedom 

Quantum system and system observables 

We will assume that the n numbers pk are the only in- 
formation that is needed and available for a computation 
of expectation values for the "system observables" of the 
subsystem. In this sense, the state of the subsystem is 
characterized by the n expectation values of the basis ob- 
servables A( k > . Only a very limited amount of the informa- 



tion contained in the probability distribution {p T } for the 
total system is needed for the subsystem. We will refer to a 
subsystem with these properties as the "quantum system" , 
even though in certain special cases it can also describe a 
classical statistical system. 

For a given quantum system the system observables are 
those classical observables that lead to probabilistic ob- 
servables A for which the probabilities w a can be com- 
puted in terms of {pk}- Then the expectation values of 
functions f(A) can also be computed from {pk}, (f(A)) = 
yV f{"~ia)wa{pk)- We will assume that the relation between 
w a and pk is linear. 

Our first question concerns a classification of possible 
system observables for the subsystem. It is straightfor- 
ward to define rescaled observables cA^ by (cA") T = 
= cp k - Furthermore, we can trivially shift 
the observable by a piece eo proportional to the unit ob- 
servable. The rescaled and shifted observables A obey 

{A)= Pk e{ A) +ei A \ (46) 

where repeated indices are summed. Here we associate to 
each A^ an n-dimensional unit vector e*- fc ' with compo- 
nents e'm = <5^j. Then the vector reads e^' = ceW if 
A = cAW. One may use c = H/2 if A describes a spin with 
standard units of angular momentum. Other units may 
be employed for alternative interpretations, as for example 
occupation number n = (1 + A^)/2 which equals one for 
occupied and zero for empty. (Contrary to widespread be- 
lief, the value of h is not a genuine property of quantum 
mechanics, but rather an issue of units.) One could include 

into ann+ 1-dimensional vector e^ A \ We will not do 
so since in the following we mainly discuss observables with 

4*> = o. 

For an arbitrary system observable A we may write the 
expectation value 

(A) =J2laW a (Pk), (47) 
a 

with w a depending linearly on pk, as a linear combina- 
tion (A) — y2k Ck (^ k ^) ■ (If necessary, we substract an 
appropriate constant shift.) This should hold for all prob- 
ability distributions {p T } which describe the quantum sys- 
tem. We can associate to each such observable the vector 
e ( A ) — J2 k Cke^ , = Cfc, such that eq. (|4U|) remains 
valid. Thus the vector can be used for a partial char- 
acterization of a system observable. Explicit constructions 
of such probabilistic observables can be found in Q , where 
observables of this type correspond to rotated spins in the 
quantum mechanical analogue. The possibility to write 
the expectation value in the form is a necessary, al- 
beit not sufficient condition for a classical observable to be 
a system observable. Beyond the vector a system ob- 
servable also needs the specification of the explicit form of 
w a {pk)- 

Probabilistic observables are characterized by the spec- 
trum {7a} of possible measurement values, and the associ- 
ated probabilities w a ■ The multiplication of the observable 
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by a constant c and the addition of a piece proportional to 
the unit observable are always defined by j a — > C7 Q + e . 
However, the sum and the product of two probabilistic ob- 
servables A, B are not defined a priori. At this stage we 
can only give a necessary condition that a possible lin- 
ear combination C of two observables A and B can be 
defined as a system observable: it has to be represented 
by linear combinations of the associated vectors e. If 
C = c A A + c B B, one should have e (c) = c^e^ + c B e (B) , 
such that (C) = Ca(A) + cb(B) can again be computed 
from eq. PS|) . 

Incomplete statistics 

In general, the joint probabilities for pairs of two ob- 
servables A^ and A® cannot be computed from the in- 
formation which specifies the state of the subsystem. In- 
deed, for each of the n(n — l)/2 pairs of unequal basis 
observables one would have to specify the probabilities 

p_i_+,p_i ,p |- and p that they have simultaneously the 

values A^ = +1,A® = +1, or A<» = +l,A® = -1 

etc. Since p ++ +p-\ +p ^ +P— = 1, this requires three 

numbers for each pair, or a total of 3n(n — l)/2 numbers. 
This is more than the n numbers pk which characterize the 
state of the subsystem. 

The joint probabilities are available only at the level 
of the classical statistical ensemble that characterizes the 
subsystem and its environment. We may pick a particu- 
lar classical representation for the probabilistic observable 
A^\ as well as for A (2 \ specified by A T 1] and A { T 2) . The 
joint probability p++ obtains then by summing the classi- 
cal probabilities p T over all states r for which both A T ^ = 1 

(2) 

and A T ' = 1. However, the joint probability p++ involves 
properties of the environment and is not a property of the 
subsystem alone. Two different classical observables A^ 
and A'r X \ which are mapped to the same probabilistic ob- 
servable A^ for the subsystem, will lead to different re- 
sults for p++. Since Ap and At 1 ^ differ only by prop- 
erties of the environment, this clearly demonstrates that 
joint probabilities involve precise knowledge of the state of 
the environment. 

From the point of view of the subsystem this "environ- 
ment information" is no longer available - it has been "in- 
tegrated out" by the coarse graining of the information. 
We refer to a probabilistic setting for which joint proba- 
bilities are not available or not used for the correlation of 
measurements as "incomplete statistics" . Hence the sub- 
system and the associated quantum mechanics is described 
by incomplete statistics in the sense of ref. jl3| ]. 

Purity 

Next we are interested in some general properties of the 
basis observables. For example, one typical question may 
ask if two of them can have simultaneously a sharp value. 
A "classical eigenstate" of a probabilistic observable A is an 
ensemble for which A has a "sharp value" with vanishing 
dispersion, (A 2 ) — (A) 2 — 0. For example, the eigenstate of 
the observable A^ with "classical eigenvalue" (A^ k ') = 1 

is characterized by p T — whenever A T = — 1. The 
maximal number of sharp "basis observables" A^ can be 



characterized by the "purity" P of the ensemble, 

P = PkPk- (48) 

(Note that P depends on the set of basis observables that 
characterize the subsystem.) For P = one finds equipar- 
tition with (A) = for all two level observables. Obviously, 
M sharp observables require P > M, since at least for M 
values of k one needs pk = ± 1 . For an ensemble with P = 1 
at most one observable A^ can be sharp. Typical classi- 
cal ensembles that describe isolated quantum systems will 
have a maximal purity smaller than n, such that not all 
can have sharp values simultaneously. We recall that 
the purity (|48[) is a statistical property involving expecta- 
tion values. For a given classical state r all observables 
A^ have a sharp value. 

More precisely, the purity is a statistical property of the 
subsystem, with a conceptual status somewhat similar to 
the entropy. It is a measure for the size of the fluctuations 
- systems with larger fluctuations have a smaller purity. A 
measure for the fluctuations of the basis observables for the 
subsystem is given by 

G k = ((A^-(A^)) 2 ) 

= ((A«) 2 ) - (A^) 2 = 1 - p 2 , (49) 

with < Gk < 1. The purity can then be expressed by the 
"measure of total fluctuations" G = J^k *-*k as 

n 

P = n-G = n-^G k . (50) 
fe=l 

For maximal fluctuations one has Gk = 1, G = n and 
therefore P = 0, whereas minimal fluctuations with Gk — 
, G = lead to the maximal possible value of the purity 
P = n. 

Density matrix 

For M an integer obeying M > P + 1 we may represent 
the pk by an M x M hermitean "density matrix" p a j3'- 

P=jj(l + PkL k )- (51) 

The matrices L k 3 are Sf7(M)-generators k = 1..M 2 - 1, 
obeying 

tvL k =0,^ = 1, tr(L fc L,) = M8 kU (52) 
{Lk.Li} — 25ki + 2dki m L m , [Lk, L{\ — 2ifki m L m - 

The matrix p has properties of a density matrix in quantum 
mechanics, 

trp= 1 , trp 2 = jj(l + P) < 1- (53) 

Furthermore, we require that p is a positive matrix in the 
sense that all its eigenvalues are positive or zero. This 
implies the positivity of all diagonal elements, 

p aa > 0. (54) 
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For a given M the positivity of p imposes constraints 
on the allowed values pk for which the subsystem can de- 
scribe M-state quantum mechanics. We will discuss these 
"positivity conditions" in more detail below. We note that 
the condition for p being a density matrix can always be 
realized for M = n+ 1 by choosing only diagonal and there- 
fore mutually commuting L k . We are interested, however, 
in the minimal M for which p is positive and the "pu- 
rity constraint" P < M — 1 holds. For a typical value 
M = P + 1 = \fn + 1 not all Lk can commute. 

In analogy to quantum mechanics, a "classical pure 
state" obeys p 2 = p and therefore requires P — M — 1. 
The density matrix can be diagonalized by a unitary trans- 
formation. In consequence, any pure state density matrix 
can be written in the form p = U p m W for a suitable U, 
with UU' = 1. Here {p m )ap — 5 m aSmp obeys trivially 
Pm = Pm-, and this property is preserved by unitary trans- 
formations. The positivity condition is obeyed obviously 
for all pure state density matrices, since the eigenvalues 
are one or zero. 

Quantum operators 

We can associate to any system observable A an operator 
A such that the quantum mechanical rule for the compu- 
tation of expectation values holds (e/. = eL ) 

A = e k L k , (A) = p k e k = tr (pi). (55) 

We will concentrate on the minimal M needed for a given 
maximal purity of the ensemble and assume that a suit- 
able positivity constraint for the p k holds such that p is 
a positive matrix. For n — M 2 — 1 the operators for the 
basis variables A^ are given by the full set of all SU (M) 
generators Lf~. If n < M 2 — 1 only part of the L k are used 
as a basis for the observables. 

Many characteristic features of the system observables A 
can now be inferred from standard quantum mechanics, as 
demonstrated by a few examples. For M = 2 at most one 
of the three possible two-level-observables A^ can have a 
sharp value. This occurs for an ensemble where the den- 
sity matrix describes a quantum mechanical pure state, 
p = |(1 ± A), trp 2 = i + ±tri 2 = 1, with i« = Lk = r k . 
For such an ensemble the expectation value of the two or- 
thogonal two-level-observables must vanish, = for 
I =/= k. Thus, whenever one basis observable is sharp, the 
two others have maximal uncertainty, as for the spin one- 
half system in quantum mechanics. 

Another example for M = 4 describes two different two- 
level-observables (say the ^-direction of two spins Sl,S 2 ) 
by Li = diag (1,1,-1,-1) and L 2 = diag (1,-1,1,-1). 
We may also consider a combined observable for measure- 
ments of the two spins S\ and S 2 . It has the value +1 
whenever the signs of the two spins are found to be equal, 
and —1 if they are opposite. This observable is represented 
by L% = diag (1,-1,-1,1). Consider an ensemble charac- 
terized by P3 = — 1 , pi = pi — 0. For this ensemble 
one has (A^) = (A^) = such that for both two-level- 
observables the values +1 and —1 are randomly distributed 
in the ensemble. Nevertheless, (A^) = — 1 indicates that 
the two "spins" are maximally anticorrelated. Whenever 



the first spin takes the value +1, the second one necessarily 
assumes —1 and vice versa. In sect. IVIIII we discuss that 
pure states of this type show the characteristics of an entan- 
gled quantum state. Our third example considers the ob- 
servable S corresponding to the sum S = L\ +L2. For the 
particular pure state density matrices (p m )a0 = 8 ma 6 m p 
one has (S) = 2 (for m = 1), (S) = (for m = 2,3) and 
(S) = —2 (for m — 4). Thus S has the properties of a total 
spin, composed of two half integer spins (say S z = Sl + S 2 ). 

Pure states and wave function 

The general form p = U p m U^ allows us to "take the 
root" of a pure state density matrix by introducing the 
quantum mechanical wave function ip a as an M-component 
complex normalized vector, i/j'i/j = 1, 

Pa/3 = Tpalpp , Ipa = U a f}{lp m )f) , 
$m)f> = <W . <^> = ^M- (56) 

All the usual rules for expectation values in quantum me- 
chanical pure states apply. 

Pure states play a special role since they describe classi- 
cal ensembles with minimal uncertainty for a given integer 
M. For M = 4 a pure state has purity P = 3 and three 
different observables can have sharp values, correspond- 
ing to the maximum number of three commuting quan- 
tum mechanical operators. For M > 2, P = M — 1 and 
{L k , L{\ = 2S k i + 2d k i m L nll the condition for a pure state, 
p 2 = p or pk[pidkim — [M — 2)#fc m ] = 0, is not automatically 
obeyed for all p k with p k p k = P. Pure states have to obey 
additional restrictions. 

For a pure state, the "copurity" 

C = tr[{p 2 p) 2 ] (57) 

must also vanish. While the purity P is conserved by all 
orthogonal SO(n) transformations of the vector (p k ), pure 
states are transformed into pure states only by the sub- 
group of SU(M) transformations. The SU(M) transfor- 
mations are realized as unitary transformations of the wave 
function -0, where the overall phase of if) remains unobserv- 
able since it does not affect p in eq. ((56)) . In our classical 
statistical description of quantum phenomena, the partic- 
ular role of the classical pure states constitutes the basic 
origin for the unitary transformations in quantum mechan- 
ics. Just as in quantum mechanics, we can write the density 
matrix p for an arbitrary ensemble as a linear combination 
of appropriate pure state density matrices. 

The pure states form the manifold SU{M) /SU(M-1) x 
17(1) |3|. This homo geneous space forms a submanifold 
of S ~ 2 ■ In other words, the copurity constraint im- 
poses on the M 2 — 2 independent real numbers p k = 
fkfk = M — 1, a number of M(M — 2) additional con- 
straints, corresponding to the 2(M — l)-dimensional space 
SU(M)/SU(M - 1) x U(l). A simple way to realize these 
constraints is the condition that the allowed fk have to 
obey 

fk = ¥L k il> , tfl> = 1 (58) 
for some arbitrary normalized wave function if) Q . 
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A linear combination of two complex wave functions 

ip = ci?A (1) + c 2 ip {2) (59) 

defines again a possible pure state if the coefficients ci and 
C2 obey the restriction that ip is normalized, ip'ip = 1. 
There exists a pure state density matrix (|56p for every pos- 
sible pure state. A classical ensemble, for which the prob- 
ability distributions can realize arbitrary density matrices, 
can therefore also realize arbitrary (normalized) superpo- 
sitions of two wave functions. This allows us to describe 
all the interference phenomena characteristic for quantum 
mechanics in terms of appropriate classical ensembles Q. 

Mixed states 

A mixed quantum state has purity P < M — 1. For ex- 

1/2 

ample, the elements pk may obey pk = (P/(M — 1)) ' 
The general positivity condition for a mixed quantum state 
can be formulated by using the possibility to diagonalize 
any hermitean matrix p by a unitary transformation 

p = Udiag( Pa )U ji . (60) 

In addition to P < M — 1 we require 

p a > 0. (61) 

In other words, classical statistical ensembles can describe 
M-state quantum systems, if the expectation values pk 
obey the purity constraint 

Y^PkPk = P = Mj^Pl - 1 < M - 1, (62) 

k a 

and further obey the positivity constraint (f60"|) . ([61]) . We 
will often refer to the combination of eqs. (fSTj) . (|60[) - (f^S|) 
collectively as the "purity constraint" , as mentioned in the 
abstract. Actually, the purity constraint (162|) follows from 
the positivity (|61|) and the normalization ^ Q p a = 1 - the 
generalized purity constraint (|60p - (|62[) is therefore equiv- 
alent to the positivity constraint for the density matrix. 
The use of the wording "purity constraint" is motivated 
by the basic observation that typical quantum systems are 
characterized by a lower bound for the fluctuations and 
therefore an upper bound for the purity. The positivity of 
p (|61[) will be crucial for many steps in our construction. 
For example, together with the normalization of p it guar- 
antees the important inequality for the diagonal elements 
of p 

< Paa < 1 (63) 

which follows directly from the properties of the unitary 
matrices in eq. (|60j) 

p aa =5> /3 |C/ ct/3 | 2 . (64) 



Probabilistic quantum observables 

The vector e^ A ' is sufficient for a determination of the 
expectation value of a system observable A in any state of 



the subsystem (characterized by pk). However, the typical 
observables for the subsystem are probabilistic observables 
and we further have to specify the probability distribution 
for the possible outcomes of measurements for every state 
{pk}- The probabilities w a (pk) are needed for a computa- 
tion of expectation values (A p ) of powers of A. Inversely, 
knowledge of all (A p )(pk) allows the reconstruction of the 
probability distribution w a (pk). We will next concentrate 
on "quantum observables". They constitute a subclass of 
the system observables for which the operators associated 
to A p are given by A p . Since a quantum operator A is 
uniquely characterized by the vector e^ A \ we can also ex- 
press A p in terms of e^ A \ Thus for a quantum observable 
the vector is sufficient for the computation of {A p ). It 
specifies the quantum observable uniquely. 

Not all system observables are quantum observables. As 
an example for a system observable that is not a quantum 
observable we may take the random two level observable 
R with spectrum 7 Q = ±1 and w+ = ui_ = 1/2 for every 
state (pk) of the subsystem. It obeys (R) = , (R 2 ) = 1 for 
all pk- From (i?) = we infer =0, R = 0. However, 
(R 2 ) differs from tr(pR 2 ) = 0. The operator associated to 
R 2 is the unit operator and not the zero operator. We infer 
that for p > 2 the property 

(A p ) = tr( P A p ) (65) 

is not generic for all system observables. We can therefore 
define in a quantum system probabilistic observables for 
which all expectation values (A p ) are computable in terms 
of the state (pk), while the operator relation (f65|) does not 
hold. They only play a minor role in quantum systems 
and we will concentrate in the following on the subclass of 
quantum observables for which the relation (|6"5"]) holds. 

For a given M we consider observables with a spec- 
trum of at most M different values j a - We first con- 
centrate on a non-degenerate spectrum of M different j a 
and identify a = a. The probabilities to find 7 Q in 
the state pk of the subsystem are denoted by w a (pk) > 

) E a ffl o(Pi=) = 1 > ( A ) = J2a W a(Pk)l a = Pk^k- 

The expectation value (A) = tr(p^4) is invariant under 
a change of basis by unitary transformations, p — * p' = 
UpW , A — > A' = UAUK We may choose a basis with 
diagonal A' = diag{\ 1 , . . . , \ M ) , (A) = J2 a Paa^a, sug- 
gesting that the spectrum "f a can be identified with the 
eigenvalues A Q of the operator A, and w a (pk) = p' aa - 

Among the system observables we therefore consider the 
subclass of quantum observables the spectrum {"j a } and 
probabilities 

w a (pk) = p' aa = {U A pU\) aa . (66) 

The purity constraint for pk guarantees < w a (pk) < 1. 
For a given observable A the unitary matrix Ua is fixed. 
Thus A is characterized by "f a and Ua, with expectation 
value 

(A) = w *(Pkh a = tr(pi). (67) 
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To each quantum observable, we can associate a hermitean 
quantum operator, with a spectrum of possible measure- 
ment values given by the eigenvalues A Q = 7 a of the op- 
erator. The classical probability for the outcome of the 
measurement in a given state is the corresponding diagonal 
element of the density matrix in a basis where A is diag- 
onal. We describe the mathematical structures related to 
quantum observables in more detail in sect. IVII1 where we 
also deal with degenerate spectra of less than M different 
7 a . There we will construct an explicit classical realization 
for the quantum observables. 

We can now discuss powers of the probabilistic observ- 
able A, (A p ) = Y,a w <*{Pk)ll- The observable Ap should 
belong to the observables of the subsystem, since it can 
be associated with p measurements of A, multiplying the p 
measurement results that must be identical. We can there- 
fore associate an operator A p to the observable A p , (A p ) = 
ti(pAp). For quantum observables this is realized by A p — 
Ap and we conclude (A p ) = tr (pA p ). For (A 2 ) =tr(pi 2 ) 
a classical eigenstate of A obeys [tr(pA)] 2 =tv(pA 2 ). The 
possible classical eigenvalues are the eigenvalues of the op- 
erator A. If a pure state is an eigenstate of A one has 
Aip = \ip with A = A a = 7 a one of the eigenvalues of A. 

Classical quantum observables 

Consider now the classical ensemble of states r which 
describe the subsystem together with its environment. It 
is straightforward to characterize the properties of the clas- 
sical observables A T that are mapped to the probabilistic 
quantum observables and the corresponding quantum op- 
erators A. First, the spectrum of possible outcomes of 
individual measurements equals the (sharp) values of the 
classical observable A T in the classical states r. It consists 
of the eigenvalues of A. Second, for all allowed probability 
distributions {p T } for the states r it must be possible to 
express w a as a linear combination of pk = {A^ k >) accord- 
ing to eq. (|66j) . More precisely, these statements have to 
hold for all states obeying eqs. f60|). (|6Tj) with P < M — 1, 
Classical observables A T with these properties are called 
classical quantum observables. 

At first sight the second requirement for a classical ob- 
servable to be a quantum observable may seem rather spe- 
cial. However, in many circumstances the quantum ob- 
servables are related to the basis observables by simple 
"physical operations" . For example, the spin observable 
in an arbitrary direction obtains from the three basis ob- 
servables A^ k \M = 2) by rotation, such that the relation 
(|66|) arises naturally [J]. Other simple operations are the 
addition and multiplication of "commuting observables" , 
as we will explain in sect. IVII1 At this point it may be 
worthwhile to pause. We have selected a set of classical 
observables with a remarkable property: whenever the en- 
semble obeys a simple purity constraint for the expectation 
values of some basis observables, all the laws of quantum 
mechanics apply for these classical observables, as for ex- 
ample Heisenberg's uncertainty relation based on the com- 
mutator of the associated operators. 

In summary, we define the quantum observables as a sub- 
class of the classical observables with a spectrum of at most 



M different possible measurement values 7 Q . Furthermore, 
the associated probabilities w a should be given by the rela- 
tion (|66|) which is linear in p)., with coefficients depending 
only on the observable. This implies the relation (Ti5|) and 

defines ej^ 4 ' . (We may add a piece proportional to the unit 
observable.) It is then also possible to compute the expec- 
tation values (A p ) in terms of the information contained 
in the quantum state through an expression linear in the 
Pk- Therefore (A p ) does not involve any properties of the 
environment - these expectation values are completely de- 
termined by the subsystem. To any quantum observable we 
can associate a unique vector (efc) and therefore a unique 
quantum operator. In turn, to each hermitean quantum 
operator we can also associate a unique probabilistic quan- 
tum observable, with w a given by eq. (|66p . On the level 
of probabilistic observables the quantum observables can 
therefore be fully characterized by the vector (e^). The 
map from the classical quantum observables to the vectors 
is not invertible, however. 

Quantum product of quantum observables 

The correspondence between probabilistic observables 
and hermitean quantum operators allows for the introduc- 
tion of a symmetric "quantum product" (AB) S — (BA) S 
between two probabilistic quantum observables, which is 
associated to the operator product \{A, B}. Similarly, 
we can now define linear combinations of quantum observ- 
ables. The sum A + B is the quantum observable which is 
associated to the sum of operators A + B. Together with 
linear combinations, the quantum product (AB) S defines 
an algebra for the probabilistic quantum observables. The 
algebra of probabilistic quantum observables is isomorphic 
to the algebra of quantum operators with product \{A, B}. 
For both, it can be expressed as operations in the space of 
(efc). In particular, we observe A 2 — (AA) S . We will see 
in sect. |V]that a particular use of the "quantum product" 
(AB) S arises from an investigation of conditional probabil- 
ities. 

In general, a probabilistic quantum observable A de- 
scribes an equivalence class of classical observables that 
all lead to the same quantum operator A. The product 
t;{A, B} then induces a product structure between equiv- 
alence classes. If a representative for each class is selected 
one can also define (AB) S on the level of classical observ- 
ables [13;]. We emphasize that the product (AB) S is not the 
classical or pointwise product A B where (A-B) T = A T B T . 

Statistical interpretation of quantum mechanics 

We obtain the laws of quantum mechanics from a classi- 
cal statistical ensemble. Therefore the postulates concern- 
ing the relation between the quantum calculus and sta- 
tistical statements about the outcome of observations can 
be derived from the probability concept for a classical sta- 
tistical ensemble. Consider Born's rule for a pure state 
wave function tp a . Let us assume that the basis vectors 
( 1 Pm)f3 = &mf} denote eigenstates of an observable A with 
eigenvalues j m , i.e. Aip m = r y m 4'm- We further assume 
that the spectrum is non-degenerate. Then Born's rule 
states that the probability for finding in a measurement 
of A the value 7 Q is given by the absolute square of the 
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corresponding component of the wave function, w a = \ipa\- 
This statement is one of the axioms of quantum me- 
chanics. It can be easily derived for our implementation of 
quantum systems as classical statistical ensembles. Since 
A = diag( , y a ) is diagonal in the chosen basis we find for the 
quantum observable associated to A that the expectation 
values of arbitrary powers of A obey 



<^)=tr(iV) = E(^)^— 



(68) 



The classical statistical rule for expectation values then 
implies 

w a =p aa . (69) 

In turn, for a pure state density matrix the diagonal ele- 
ments obey 

(70) 



Pac 



thus establishing Born's rule. The statistical interpretation 
of a mixed state density matrix follows along the same lines, 
since eqs. (|68|) . (|69|) do not need the assumption of a pure 
state. The basic ingredients are the statistical interpreta- 
tion of (A p ) as an expectation value of an observable in 
a classical statistical ensemble, and the relation for quan- 
tum observables (A p ) = tc(A p p). Born's rule follows for all 
classical quantum observables. The generalization for the 
case of a degenerate spectrum implies sums over all a for 
which the eigenvalue in the corresponding basis state has 
the same value j a . 

In summary of this section we have considered subsys- 
tems of a classical statistical ensemble whose state is char- 
acterized by the expectation values pk of basis observables. 
If the pk obey a purity constraint they define a quantum 
density matrix p (15ip . The positivity of p is equivalent to 
the purity constraint. We have discussed system observ- 
ables for which (A p ) is computable in terms of (pk), and 
concentrated on the subclass of quantum observables for 
which eq. (|65|) holds. For quantum observables all the 
rules of quantum mechanics for the outcome of measure- 
ments, as Born's rule or Heisenberg's uncertainty relation, 
can be inferred from the probabilities of the classical sta- 
tistical ensemble. 



IV. SIMPLE QUANTUM SYSTEMS 

In this section we present explicit examples which 
demonstrate the general considerations of the preceding 
section. We discuss classical ensembles that realize all as- 
pects of quantum-operators and their expectation values 
for two-state and four-state quantum mechanics. These en- 
sembles involve infinitely many classical states, while the 
subsystems are characterized by only a few real numbers. 
We give explicitly the classical observables that are repre- 
sented by quantum operators and comment on the asso- 
ciated equivalence classes of classical observables that are 
mapped to the same probabilistic observables and quan- 
tum operators. The issue of correlations for measurements 



of pairs of such observables will be addressed in the next 
section. 

Two-state quantum mechanics 

As an instructive example we may consider a classical 
ensemble corresponding to two-state quantum mechanics 
For this purpose we generalize the setting of sec- 
tion |TT] by considering spins in arbitrary directions. In 
sect. [II] the three cartesian spins A^\ A^ 2 \ A^ are re- 
alized on the classical level by the observables a\ , cr% , 03 • 
We now denote the direction of the spin by a unit vector 
(e%, e%, 63), efeefe = 1, such that A{e k ) is again a two level 
observable, A 2 (e k ) = 1, and = A(1,0,0) , A^ = 
A(0, 1,0) , A<- 3 > = A(0,0,1). Next we choose for every 
angular direction on S 2 /Z2 (denoted now by g k ) a dis- 
crete variable 7(<?fc) — ±1 , 9k9k — 1, with special cases 
a x = 7(1,0,0) , o 2 = 7(0,1,0) , ct 3 = 7(0,0,1) treated 
in sect. [TTJ Here it is sufficient to have a variable 7 for 
every point on the half-sphere S 2 /Z2 since we can identify 
7(~<?fc) — — lidk)- We may consider the infinitely many 
classical bits "f(gk) as the limit of a sequence where an in- 
creasing number of points on S 2 /Z2 is included. The clas- 
sical probability distribution depends now on sequences of 
an infinite number of classical bits "f(gt) , t — {7(5*1)} , 

Pt = p({7(3fc)})- We again use the form (3) 

p(H9k)}) =Ps({l(g k )}) +5p e ({j(g k )}) (71) 



with 



and 



ps({i(9k)}) =n 



[l + Pkgki(gk)) 



(72) 



E P.({7(»)})=1, E =11 E ( 73 ) 

The classical representation of the spin in the direction 
efe involves precisely one of the discrete variables 

A(e fc )=A(e fc )({ 7 (. 9fc )}) =7(e fe ). (74) 

In other words, the observable A(ek) "reads out" the clas- 
sical bit with g k = ±e k - Its expectation value is given 
by 

(A(e k ))= E li.^)p([l{gk)}) = p k e k , (75) 
{7(3*)} 

where we assume as in sect. [TT] 

E *P* = °> E 7(e fc )5p e = 0. (76) 

(7(Sfc)} {7(5*;)} 

Correspondingly, the probability to find in the ensemble 
the value +1 for the observable A(ek) reads 

w+{e k ) = \{l + (A(e k ))) = i(l + Pk e k ). (77) 
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It can be expressed in terms of the vector (p k ) which char- 
acterizes the state of the subsystem and corresponds to the 
expectation values of the three basis observables 

pi = <A(l,0,0)) , pa = (4(0,1,0)) , P3 = (4(0,0,1)). 

(78) 

The probabilities w+(e k ) (with w^(e k ) = 1 — io+(e^)) char- 
acterize the probabilistic two-level observables A(ek)- For 
P = PkPk < 1 this classical statistical ensemble realizes 
two-state quantum mechanics. 

Four-state quantum mechanics 

We next generalize the ensemble in order to describe 
four-state quantum mechanics. As long as only two-level 
observables are concerned this is straightforward. Instead 
of three basis observables we now consider 15 two-level ba- 
sis observables A^ m%> , m = 1 ... 15, whose expectation val- 
ues characterize the quantum state 

(4(™)) = (4(4 m) ))=p m . (79) 

The associated quantum operators are the fifteen SU(4) 
generators L given in eq. l(28|). The directions of general 
two-level observables can again be associated with a fifteen- 
component vector (efc), with basis vectors = <5™. We 
impose the restrictions 

e k e k = 1 , eke m dkmi = 0, (80) 

with dkmi = d m ki the symmetric "structure constants" in 
eq. ([52)1 which obey for every k dkki — 0. The manifold 
parameterized by (e k ) with the condition (|80[) is the ho- 
mogenous space SU(4)/SU(3) x U(l). 

For every point g k of SU(4)/SU(3) x [7(1) x Z 2 we 
introduce again a discrete variable y(g~k) , 7 2 ($fc) = lj 
l(~9k) = — l(9k)- The split of an arbitrary classical proba- 
bility distribution describing an isolated subsystem is done 
similarly as before 

p(W$k)},t) =Ps({l(9k)})p s (0+S Pe {{ 1 (g k )},c) 

(81) 

with £ a further collective variable characterizing addi- 
tional degrees of freedom of the classical states. With p s 
given again by eq. ([73]), £ c p 8 (C) = 1, 

^<5p e = 0, 5>( e *)*« = °> (82) 

r r 

we find for all two level observables A(ek) 

(A(e k )) = Pk e k = tr(pi). (83) 

The last part holds for P = p k p k < 3 and p the associ- 
ated density matrix (|51|) . provided we represent A(e k ) by 
the operator A = e k L k . We observe that A 2 = 1 holds 
only if the condition is obeyed. This condition states 
that instead of arbitrary SO(15) - rotations of the basis 
observables A^ we only consider SU(4) transformations 
thereof. 

Four state quantum mechanics admits probabilistic ob- 
servables with a spectrum with up to four different values 



7 a . The associated classical statistical ensemble should de- 
scribe this situation. In particular, the part of the classical 
probability distribution {p T } which describes the subsys- 
tem should contain the information about the probabilities 
w a (pk) to find the measurement values 7 a for observables 
with three or four different values j a . This is the role of 
p s (£) in eq. (fSTj) . The description of observables with three 
or four different values in the spectrum will be associated 
to quantum operators A = e k L k with values of (e k ) not 
obeying the second restriction in eq. ([50]) . 

We can restrict the discussion to observables for which 
the spectrum (71,72,73,74) obeys 

5>a=0, E^= 4 - ( 84 ) 

a a 

Observables with a different normalization can be obtained 
by multiplicative scaling A — > XA, and the first condition 
in eq. ([84]) can always be achieved by subtracting an appro- 
priate part proportional to the unit observable. If the state 
of the subsystem can be described by (p k ) with p k p k < 3 
and therefore by an associated density matrix p, and if the 
probabilistic observable A can be associated to a quantum 
operator A such that (A) = tv(pA), the conditions 
read 

tri = , tri 2 = 4. (85) 

This is obeyed for A = e k L k if (e^) is a unit vector on 
S 15 ,e k e k = 1. The conditions lj53]l fix 73 and 74 as a 
function of 71 and 72 , such that the spectrum of normal- 
ized observables can be characterized by two parameters 
(71,72), while 

73 + 74 = -(71 +72), 
73 + 74 = 4 - 7i 2 - 7 2 2 , 

73 - 74 = ± V /8 - 37 1 2 - 37 2 2 - 2 7l 7 2 . (86) 

This determines 73 and 74 uniquely up to an exchange 
73 74- We observe the restriction 

7i 2 +7 2 2 + ^(7i+7 2 ) 2 <4. (87) 

The two level observables correspond to the special case 
where 71 = ±1 , 72 = ±1. They obey the second condition 
((80| for (ek), which defines a submanifold of S 15 . This sub- 
manifold is the homogeneous space SU(A)/SU(2) x SU(2), 
as can be seen by the following argument. We can charac- 
terize the (e k ) obeying eq. (fSTj)) by the properties of the as- 
sociated operator A = e k L k as tr4 = , A 2 = 1 , A* = A. 
The spectrum of a hermitean matrix remains conserved by 
unitary SU(4) transformations A — > UAU^. On the other 
hand, for any pair (A, B) of hermitean matrices with iden- 
tical spectrum there exists a unitary transformation such 
that B — UAU^. (The spectrum defines the orbits of the 
SU(4) transformations.) We can therefore obtain A by a 
unitary transformation from the generator L\, A = U LiUK 
Since L\ is invariant under the subgroup SU(2) x SU(2) 
we can use the transformations in SU(4)/SU(2) x SU(2) 
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in order to describe the two-level operators A and the as- 
sociated (e k ). 

Let us next specify a classical statistical ensemble that 
can describe four-state quantum mechanics. Consider unit 
vectors (g k ) on S 15 , g k gk = 1 , k = 1 . . . 15. To every 
g k we associate a discrete label a(g k ) which can take the 
value a(gk) = 1,2,3,4. The classical states are sequences 
of these discrete variables 

r={«(gk)}, E=n e ( 8g ) 

t Sfc a(g k ) = l 

This construction generalizes the one for M = 2: instead of 
(g k ) € 5 3 we now have (g k ) € S* 15 , and the discrete variable 
■j(gk), which can take two values for M = 2, is replaced by 
a(gk) which can assume four discrete values. The classical 
probability distribution is again written in the form 

Pr = {Ps)t + (Sp e )r, 

{Ps)t = Y[w a (p k ,g k ), (89) 

9k 

which (Sp e ) T characterizes the environment and (p s ) T the 
system. The probabilities w a > obey ^ Q w a = 1 and are 
defined as follows. We diagonalize the hermitean matrix 
G = gkLk by a unitary transformation 

G = g k L k , G = U{g k )GU\g k ) = diag(\ a (g k )), (90) 

where U(g k ) is specified such that Ai > A2 > A3 > A4. 
From the density matrix p = (1 + p k L k )/A we obtain 

w a (p k , 9 k) = [U( 9 k)p(pk)UHg k )] aa . (91) 

This construction implies ^2 t {p s )t — 1 such that Sp e has 
to obey 

E(^e)r = 0. (92) 
r 

A classical realization of quantum observables A(e k ) can 
be implemented by 

{A{e k )) T = i a{gk=ek) . (93) 
Here -f a ( gk =e k ) is defined by A = e k Lk,e k ek = 1, and 

la =[U(g k )Au\g k )] aa . (94) 

In other words, the observable A(e k ) again "reads out" 
from the sequence {a(gk)} the element for g k = e k . Then 
A(ek) = G(gk), and therefore U{gk)AU' { (g k ) is diagonal, 
such that 7 a equals the eigenvalue X a (g k ) of G in eq. ([50]). 
In all classical states r the observable A(ek) has a fixed 
value which is one of the eigenvalues of the associated op- 
erator A. 

The sequence {a(g k )} may be constructed by adding 
consecutively different "angles" on S 15 . For illustration we 
may consider a reduced system where only three different 



angles (g k m ) are included, such that a m — a(g k m ) and 
the 4 3 classical states r obey 

t = (ai,a 2 ,a 3 ) , (p s ) T = w ai w a2 w a3 . (95) 

In this system we may define three classical quantum ob- 
servables 

AW=A(e k = g<T ) ), 4 m) =* ( Z\ (96) 

with A*- m ) an eigenvalue of G^ m ' > — g k m ^ Lk- 

The two level observables obtain as a special case for 
those e k for which 7 a ( ek )=i = J a (e k )=2 = 1 , 1a(e k )=3 = 
7a( ei!l )=4 — ~1- The associated g k are those for which 
G = diag(l, 1, — 1, —1) = L\. We infer that the additional 
index ( in eq. (f8"Tj) corresponds to the sequence {a(gk)} 
for those g k for which dki m gkgi 7^ 0. In other words, we 
can write the sequence {a(g k )} = {l(gk),ct(gk)}- Here 
we observe that those gk for which the spectrum of G(g k ) 
consists only of ±1 need only two classical possibilities 
a(g k ) = 1 , ct(g k ) = 2 corresponding to 7 = 1 and 7 = — 1. 
(Similarly, if the spectrum of G(g k ) has only three distinct 
values, three values for a(g k ) are sufficient.) 

For the part of the classical probability distribution char- 
acterizing the environment we impose in addition to eq. 
(f9"2")l the conditions 

EW r (Wr = (97) 

T 

for p = 1, 2, 3. We can then evaluate the expectation values 
((A(e k )j p ) = E (Mek)) P T (Ps)r = E(7afu; Q (p fe ,e fc ), 

r a 

(98) 

where (71,72,73,74) is the ordered spectrum of eigenval- 
ues of A = GkLk- The last expression in eq. (|9"5]l can be 
interpreted as 

((A(ek)) P )=tr(pAn (99) 

in a basis where A is diagonal with ordered eigenvalues. 
The trace is invariant under a change of basis by unitary 
transformations and we have therefore realized the quan- 
tum rule for the computation of expectation values of pow- 
ers of A{e k ). In particular, one finds 

(A(e k )) = tx(pA(e k )) = Pk e k . (100) 

The discussion shows that (A(e k )j describes a classical 
quantum observable. It can be mapped to a probabilistic 
quantum observable with spectrum {'f a (sk)} and associ- 
ated probabilities w a {pk, e k ). 

We emphasize that the four conditions (I92p (|97p for p = 
0,1,2,3 are actually sufficient in order to ensure eqs. (f9"5|) 
(|99f for arbitrary p. The reason is that [A(e k )) can take 
at most four different classical value y a (ek)- We can define 
the probabilities of their occurrence w a by the sum of the 
classical probabilities for all states t for which A T — 7 a (efc) 

W a (pk,e k ) =J2Pr\A T =~ (a (e k )- (101) 
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Eq. (|98|) for p = 1,2,3 fixes w a (pk,ek) uniquely for any 
given p k and e/.. Then arbitrary functions /((A(efe)J must 
have expectation values given by 

(f(A(e k )))=tv(pf(A(e k )). (102) 

Equivalence classes of probabilistic observables 

Two classical observables A T , B T are equivalent from 
the point of view of the subsystem if they have even for- 
mally the same spectrum {7 a } and if for all allowed classi- 
cal probability distributions {p T } the probabilities w a are 
equal. This defines equivalence classes associated to prob- 
abilistic observables which are characterized by four pairs 
(ja,w a (pk)) ■ For two equivalent observables the expecta- 
tion values of arbitrary functions f(A), f(B) coincide 

(f(A)) = (f(B)). (103) 

If the allowed {p T } realize all possible (pf.) with pkPk < 3, 
and if A and B are quantum observables, the associated 
operators have to coincide, A = B. Nevertheless, the clas- 
sical observables A T , B T are distinct if 

({A- B) 2 ) = Y^Ar-BrYpr 

T 

= 2{A 2 } -2Y, A rB T p T > (104) 

T 

for some allowed probability distribution {p T }- In this re- 
spect the crucial observation is that in general the classical 
correlation 

A t B tPt = (A ■ B) (105) 

r 

depends on the environment (5p e ) T . It is often smaller then 
(A 2 ) such that A T and B T are distinct classical observables. 

As an example, we consider two "diagonal quantum ob- 
servables" B and C such that A — /(C) is equivalent to B 
for some appropriate function /. The spectra of B and C, 

, / (B) (B) (B) (B)x , / (C) (C) (C) (C)x 

namely (7} >,j y 3 >,Y 4 >) and (7} ',7^ >,j 3 >,j\ 

( q\ (C) 

obey both eq. (|5o) and are related by 7^ = /(7a )• The 

(B) (C} 

associated e k ,e k vanish for k > 4 and obey 

(B,C) 1/ (B,C) . (S,C)n 

el = 2IT1 +^2 ) i 

(B,C) 1/ (B,C) . (B,C)s 

4 = gVW +73 )) 

(B.C) 1/ (B,C) . (S,C)n 

e 3 -(7} ' ; +74 )• ( 106 ) 

They are distinct if the spectra of B and C differ. The 
observables B and C therefore read out different general- 
ized classical bits as — •^(e^ 5 '') and ac = Q! (e[, C '' > ) from 
the sequence {a(gk)}- The classical correlation function 
receives contribution from p s and Sp e , 

(f(C) ■ B) = (/(C) • B). + (/(CO • B) env , (107) 



with 

(f(P)-B)„ 

= EE^ 4 S Va c (Pfe, ef ) )7^ ) /(7 Q < c ) ) 
= (B)(/(C7)). (108) 
The contribution form the environment 

(/(C) • B) e ™ = ^(<5p e ) r S T /(C T ) (109) 

T 

is not fixed by the conditions (|97p for J 4(e[ B ' > ) and A(e k C ^). 
Its value depends on the detailed choice of {(5p e ) r }. As a 
particular environment we may take (Sp e ) T = for all r, 
which trivially obeys eq. (|9T|) and implies (f(C)-B) env = 0. 
It is now easy to find allowed probability distributions for 
which 

((f(C)~B) 2 )=2(Bi)-2(B)(f(Q) (110) 
differs from zero. For example, this will be the case if 

( B) 

(B) = ei pk — 0. We conclude that the classical observ- 
ables represented by B T and f(C T ) are distinct. As we have 
mentioned already, the concept of equivalence classes is 
crucial in order to avoid conflicts with the Kochen-Specker 
theorem 0, Q . Contradiction would arise if for two com- 
muting operators B and C, with B = /(C), the corre- 
sponding classical observables B and /(C) would always 
be identical. 

The size of the equivalence classes, or the ensemble of 
those classical observables which are mapped to a given 
probabilistic observable, depends on the notion of allowed 
probability distributions {p T }- For the split (fB"9")) and as- 
suming that all pk with pkPk < 3 are allowed, this de- 
pends on the allowed distributions |(<5p e ) r } characterizing 
the environment. For a fixed |(<5p e ) r }, as for example 
(Sp e ) T = 0, the equivalence classes are very large. For 
an observable with a given spectrum of at most four dif- 
ferent values 7q only three conditions of the type (jlOip 
must be met. This can be achieved by a huge number of 
possible choices for values A T among the four values 7 a . 
Even though these conditions have to hold for arbitrary 
values of pk, there are many different possible choices of 
A T - in general not of the simple form (|93|) - which can 
obey these conditions. If we take a different fixed choice 
of {(5p e ) r } the equivalence class corresponding to a given 
probabilistic observable will contain different classical ob- 
servables as for the case (Sp e ) T — 0. If we admit both 
|(<5p e ) r } for the allowed probability distributions, only the 
classical observables which fulfill eq. (|101[) for both choices 
of (Sp e ) T belong to the equivalence class. Observables for 
which (A p ) differs for the two different {{5p e ) T } are not 
system observables. As we increase the number of allowed 
{(<5p e ) r } the ensemble of system observables shrinks, and 
so do the equivalence classes associated to a given proba- 
bilistic observable. If arbitrary {{8p e ) T } are admitted each 
equivalence class would contain only one classical observ- 
able. For |((5p e ) T } obeying the constraints (j97|) we have 
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shown by explicit construction that the equivalence classes 
contain distinct classical observables. 

We end this section by noting that the construction of 
a classical ensemble representing four-state quantum me- 
chanics can be generalized in a straightforward way to M- 
state quantum mechanics with arbitrary M. Quantum me- 
chanics for observables with a continuous spectrum can be 
obtained for appropriate limits M — > oo. On the other 
hand, if one is not aiming for a description of all possible 
quantum observables, as for the classical statistical imple- 
mentation of a quantum computer in sect. [Til the space of 
classical states r can be reduced by restriction to a subset 
of g k - 



V. CORRELATIONS 

Beyond a rule for the computation of expectation val- 
ues of observables, any theory must provide a prediction 
for the outcome of two consecutive measurements. After a 
first measurement of the observable A the result of a subse- 
quent measurement of another observable B is, in general, 
influenced by the first measurement. In a statistical system 
two measurements are typically correlated and one has to 
specify the "measurement correlation" . It is, a priori, not 
always obvious which correlation should be chosen, since 
the measurement of A may have changed the ensemble or 
the knowledge of the observer. 

Conditional probability 

For simplicity we concentrate in this section on two-level- 
observables, (A 2 ) = (B 2 ) = 1, A 2 = B 2 = 1. The proba- 
bility of finding B — 1 after a measurement A = 1 amounts 
to the conditional probability (ur?)4. There are, in princi- 
ple, different ways to specify the conditional probability. A 
valid definition should be appropriate for the properties of 
a given measurement. For a "good measurement" we know 
that after the measurement A — 1 the ensemble must be 
an eigenstate to the eigenvalue 7^ = 1 - otherwise a sub- 
sequent measurement of A would not necessarily yield the 
same value as the first one. Then B is measured under this 
condition. 

We take here the attitude that there is only one given re- 
ality, but physicists can at best give a statistical description 
of it. The "fundamental laws" are genuinely of a statisti- 
cal nature [T3| and only establish relations within different 
possibilities for the history of the real world. Measuring 
for an observable A in a given state the value j a simply 
eliminates the other possible alternatives (which may have 
nonvanishing probabilities w a ^a)- After the measurement 
of A it makes only sense to ask what are the outcomes of 
other measurements under the condition that A has been 
measured to have the value 7 a . 

On the level of the classical statistical system with in- 
finitely many degrees of freedom, which describes the sys- 
tem and its environment, the elimination of the possible 
histories which are not compatible with the first measure- 
ment of A is not unique. Many different classical probabil- 
ity distributions p T can be eigenstates of A with 7^ = 1. 



One will have to specify how this elimination is done. In- 
deed, for a given classical representation A T one may "elim- 
inate" after the first measurement all states r for which 
A T = —1. Setting p T = if A T = — 1 leaves an eigen- 
state with (A) = 1, independently of how the probabilities 
for the states with A T — 1 are distributed after the first 
measurement. One possibility would be to keep the rela- 
tive probabilities of all states r for which A T = 1 the same 
as before the measurement. However, one could apply the 
same procedure to a second representation A' T , where now 
p T = for all r with A' T = — 1. If one keeps again the 
relative probabilities of the states with A' r = 1, the re- 
sults would differ from applying this prescription to A T . 
With such a prescription the state of the ensemble after 
the measurement A = 1 would therefore depend on the 
precise choice of A T or A' T , whose difference concerns only 
properties of the environment. Such a prescription (which 
is actually behind the use of the classical correlation for 
pairs of measurements) can only make sense if the mea- 
surement can resolve the details of the environment. It is 
clearly inappropriate for a measurement of the subsystem 
whose outcome does not depend on the environment. 

Measurement correlation 

The measurement correlation (BA) m describes the out- 
come of measurements of pairs of two observables A and 
B. As a criterion for a measurement that preserves the 
isolation of the subsystem and only measures its proper- 
ties we postulate that it should be possible to determine 
the measurement correlation (BA) m by using only infor- 
mation which is available for the subsystem. It must be 
possible to compute (BA) rn from the pk characterizing the 
original state of the subsystem before the measurement of 
A. No information about the details of the environment 
should be needed. 

The measurement correlation or conditional correlation 
(BA) m multiplies the measured values of A and B, weighed 
with the probabilities that they occur 

(BA) m = «)^< s -(^)^< s 

-«)^< s + (^< s . (HI) 

Here w± s denotes the probability that A is measured as 
±1 in the state s, with (A) = w+ s — s — tr (pA), 
w+ s + g = 1. The conditional probabilities must obey 
(w!l)± + (w?.)± = 1. The conditional correlation needs 
a specification of the conditional probabilities as (w±)+. 
For their computation we use the prescription that after 
the first measurement A = 1 the density matrix pa+ must 
describe an eigenstate of A, tr(ApA+) = 1. This effect of 
a first measurement may be called state reduction. The 
subsequent measurement of B then involves this state, 

(w*)$ - (w B )i = tr{B PA+ ). (112) 

The relation (|112[) is based on the property that the 
quantum observable B obeys eqs. (pf6"|). (1551) for an ar- 
bitrary quantum state of the subsystem. Our assumption 
is therefore that after the measurement of A the classical 
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ensemble still describes a quantum system. This seems rea- 
sonable for appropriate measurements since otherwise the 
first measurement destroys the isolation of the subsystem 
instead of only changing its state. This assumption has 
far reaching consequences, however. It necessarily implies 
eq. (|112p and excludes the option of using the classical 
correlation for the general description of subsequent mea- 
surements, as we will see below. 

Quantum correlation 

For M = 2 the matrix pa+ is unique, pa+ = h(l + A), 
such that 

= \± \tx{BA) , (w^ = \t jtr(Sl). (113) 

However, for M > 2 one has tv{ApA+) = l,txpA+ = 1 for 

PA+ = ^(l + A + X), tr(iX) = 0, 

trX = 0, (114) 

where the purity after the measurement obeys 

P = Mtv{p\ + ) = 1 + ^trX 2 . (115) 

With 

p\ + - PA+ = J^( X * + {4 X )) -^-J}) PA + ( 116 ) 

a necessary condition for pa+ describing a pure state is 
P = M - 1, trX 2 = M (M - 2), which implies X = only 
for M = 2. 

We may distinguish between a "maximally destructive 
measurement" where all information about the original 
ensemble except for the value of A is lost, and a "mini- 
mally destructive measurement" for which an original pure 
state remains a pure state after the measurement. A max- 
imally destructive measurement is described by X = in 
eq. (|1 14[) , leading to 

(B) A+ = (wf)i-(w B )i = ^tr(M) = (BA) max , (117) 

Here we denote by (B^4) max the conditional correlation for 
maximally destructive measurements and use that pa- ob- 
tains from pa+ by changing the sign of A in eq. Q114[) 
(with X — 0). We can use {BA) max for the definition of 
a scalar product between the observables B and A, since 
it does not depend on the initial ensemble. The two-level 
observables A^ k ' form an orthogonal basis in this sense, 

A minimally destructive measurement of A = 1 projects 
out all states with A = — 1, without further changes of the 
original ensemble and associated density matrix p, 

PA+= 2(1+ 1 + + A). (118) 

Similarly, a first measurement A = — 1 maps p on pa-i 
which obtains from eq. (| 1 1 8[) by changing on the r.h.s. all 



+ signs to — signs. We note that X in eq. (| 1 14|) depends on 
p. Due to the normalization factor and since (A) depends 
on p, the map p — > pa + is not linear in p. If p is the density 
matrix corresponding to a pure state tp, this also holds for 
i>A+, with 

^ A+ = [2(l + (A))]- 1/2 (l + i)^. (119) 

For M = 2 eq. dE) yields p A + = |(1 In sect. |IX] 

we will discuss the map p — > in more detail when we 
address the issue of sequences of measurements. 
With 

(BA) rn = tr(% + )tcf )S - tr(Bp A _)tfld, a , (120) 

and 

< a = (l±(A»/2 (121) 

our prescription for pa + yields for the measurement corre- 
lation 

(HA) ro = itr({i,S}p). (122) 

Thus the conditional correlation for minimally destructive 
measurements in the classical statistical ensemble corre- 
sponds precisely to the expression of this correlation in 
quantum mechanics. It involves the anticommutator and 
is therefore related to the quantum mechanical operator 
product. On the level of probabilistic observables we can 
express the conditional correlation (BA) m in terms of the 
expectation value of the quantum product (BA) S 

(BA) m = ((BA) S ), (123) 

demonstrating the close connection between the quantum 
product and the conditional correlation. 

The two point correlation is commutative, (BA) m = 
(AB) m , We will postulate that the two point correlation 
(|123p describes in general the correlation between two mea- 
surements for quantum systems and call it "quantum cor- 
relation" . We have motivated its use by two subsequent 
measurements, but the order of the measurements does ac- 
tually not matter. It seems therefore natural to use this 
correlation for any measurement of pairs of observables, 
independently of the time order. 

Classical correlation 

At first sight, a possible alternative choice may be the 
"classical correlation" which is based on the classical prod- 
uct A ■ B, as defined on the level of the classical ensemble, 
(A ■ B) T = A T B T . We will see in sect. IVIII however, that 
A ■ B is usually not a quantum observable and can there- 
fore not be determined from the information characterizing 
a quantum state, i.e. from {pk}- Using the classical cor- 
relation (B ■ A) — J2tPt(B ■ A) T — ^Y^tVtBtAt would 
therefore need information which relates to the environ- 
ment, but not only to the subsystem. In other words, the 
use of the classical product corresponds to a state reduction 
after the first measurement where substantial information 



18 



about the relation between the subsystem and the envi- 
ronment is retained. This is not what a good measurement 
in an isolated subsystem does. The classical correlation 
can therefore not serve for the description of such mea- 
surements. For any measurement where the outcome (in- 
cluding the state reduction) can be expressed in terms of 
information available for the subsystem, the choice of the 
quantum product seems natural. It retains a maximum of 
the information which is available in the subsystem. 

Since the correct choice of the correlation for a descrip- 
tion of two measurements is crucial we may describe the 
issue in some more detail. In any statistical setting one 
should distinguish the probability w++ that the two level 
observables A and B are measured with values A = 1, B = 
1 from the probability p++ that they "have" the values 
A = B = 1 in the classical statistical ensemble before the 
first measurement. On the classical statistical level one can 
express 

P++ = \(1 + (A) + (B) + (A-B)) (124) 

in terms of the classical correlation 

(A-B) =J2p T A T B T . (125) 

T 

The probability p++ does not specify a priori the condi- 
tional information relating two subsequent measurements, 
which is necessary for w ++ . In general, one needs a sep- 
arate prescription how u>++ should be computed from the 
available statistical information. Only under particular cir- 
cumstances, one may be able to identify u>++ with p++. 
In other words, w++ = p++ is an additional basic as- 
sumption which does not hold true in general. This con- 
trasts to the case of a single measurement for A, where 
w + = p + = (1 + (A))/2 by definition. 

If we try the identification u>++ = p++ for measurements 
in an isolated subsystem we run into severe problems. In 
general, the classical correlation (A ■ B) is not computable 
in the subsystem. It is a property of the system and its 
environment and cannot be obtained from the information 
characterizing the quantum state, i.e. from pk- This prob- 
lem is closely linked to the fact that the mapping from the 
classical observables described by A T to the probabilistic 

(A) (A) (A) 

observables described by 7a = ±1 and u>a = w± is not 
invertible. The classical observable A T describes properties 
of the subsystem and its environment, while the character- 
ization of the environment is only lost on the level of the 
probabilistic observable A. The use w++ = p++ corre- 
sponds to an implicit definition of the conditional proba- 
bility for two measurements where after the first measure- 
ment A = 1 all classical states r for which A T = — 1 are 
eliminated without changing the relative probabilities of 
those states where A T = 1. This elimination process de- 
pends, however, on the particular observable A T and there- 
fore also reflects properties of the environment, not only of 
the subsystem. Two observables A T and A' T , which lead to 
the same probabilistic observable A, produce, in general, 
different classical products (A-B) ^ (A' ■ B) . (These issues 
are discussed in more detail in sect. I VIII ) 



Choice of measurement correlation 

If the state of an isolated subsystem can be described by 
{pk}, this information must also be sufficient for a predic- 
tion of the outcome of two measurements. The probability 
w++ must be computable in terms of {pk}- For this reason 
we employ the quantum correlation (AB) m (|122[) . (|123|) 
and postulate 

w++ = \(l + (A) + (B) + (AB) m ), (126) 

as advocated already before for subsequent measurements. 
(A different motivation for the use of the quantum corre- 
lation is given in (l3j.) Indeed, now w++ can be expressed 
in terms of pk 

- ++ = \ (i + 4 A) 4 S) + p*\A A) + 4 fl) + d mlk e^4 B) i) 

(127) 

The other probabilities etc. obtain from eqs. (|126p . 

(I127P by appropriate changes of relative signs. 

The prescription for the probabilities of the outcome 
of two measurements influences strongly the statistical 
properties of correlations. For example, one may ask if 
a "hidden variable theory" is possible, where there ex- 
ist discrete functions A(v) = ±1 , B(v) — ±1 such that 
(AB) m = / dvp(v)A(v)B(v) with some probability distri- 
bution p(v). Just as in quantum mechanics, this can be 
excluded by the use of Bell's inequalities @, [HI- In our 
classical statistical setting the correlation function (|122[) , or 
the probabilities for the outcome of two measurements, are 
exactly the same as in quantum mechanics. On the other 
hand, Bell's inequalities apply to the classical correlation 
(A ■ B). Besides theoretical arguments we have therefore 
also experimental evidence that in general the classical cor- 
relation function should not be used for the description of 
the outcome of two measurements. 

Complete sets of measurements 

Consider the special case of two two-level-observables A 
and B for which the associated quantum operators A and 
B commute, 

[A,B]=0. (128) 

After a sequence of two measurements, where first A is 
measured to have the value ja = ±1, and subsequently B 
is measured to have the value jb = ±1, the state of the 
system is given by 

pBA, JB y A = ^-{1 + 1B B){1 + 1A A)p{l + 1A A)(\ + 1B B), 

(129) 

where the normalization factor Af assures t^PBA,-i B -i A = 1- 
Due to the vanishing commutator (|128|) the order of the 
measurements of A and B does not matter and we may 
say that A and B are measured "simultaneously" . The 
state of the system after the simultaneous measurements 
of A and B depends only on the initial state and the "out- 
come of the measurement", i.e. the four possible values 
(ia,1b) = (+,+), (+,-), (-,+), (-,-)■ We can consider 
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the combined measurements of A and B as a single mea- 
surement. 

The state after the combined measurement can be writ- 
ten in terms of the four projectors 

P 1B1A = \(1+1bB)(1 + 1a A) (130) 

as 

PlBlA — ^^IbIaP^IbIa- (131) 

We observe that p lBlA is automatically an eigenstate of 
any quantum observable D for which the associated oper- 
ator obeys D = AB (and therefore also commutes with A 
and B). The associated eigenvalue is given by the prod- 
uct 7d = • 7s- A simultaneous measurement of D, A, B 
is therefore possible, but the measurement of D does not 
yield any new information about the system. For four-state 
quantum systems (M = 4) the simultaneous measurement 
of two "commuting two-level observables" constitutes a 
"complete set of measurements" . After the measurement of 
the complete set, the system is in a pure state. (We assume 
here that J\f is finite and postpone a more detailed discus- 
sion of the limiting case AT —> oo to sect. IIXI ) The same 
must actually happen after the measurement of a quantum 
observable with a non-degenerate spectrum of four differ- 
ent eigenvalues. After the measurement of a given eigen- 
value the system is projected to the corresponding eigen- 
state which yields in this case a unique pure state density 
matrix. There is a close connection between measurements 
of non-degenerate quantum observables (and also general 
quantum observables) and complete sets of measurements 
for two-level quantum observables. We will not discuss this 
in detail in this paper. 

It is straightforward to generalize these concepts to gen- 
eral M-state quantum systems. A complete set of mea- 
surements of two-level observables projects the state after 
the measurement to a pure state density matrix. We will 
assume a minimal set in the sense that "redundant observ- 
ables" , whose values can be predicted uniquely after mea- 
surements of the minimal set, are removed from the set. 
(They are the analogue of the observable D in the preced- 
ing paragraph.) Instead of a set of commuting two- level ob- 
servables one may also use "commuting observables" with 
a spectrum of more than two distinct eigenvalues. After a 
complete set of measurements the system is in a pure state 
which is a simultaneous eigenstate of a "maximal set of 
commuting operators" , as familiar from quantum mechan- 
ics. 



VI. QUANTUM TIME EVOLUTION 

We have seen how quantum structures can arise from the 
description of subsystems where the "state of the system" 
is described by n expectation values of "basis observables" . 
For P < n the appearance of "non-commuting structures" 
is mandatory. The question remains why such quantum 
systems are omnipresent in nature, in contrast to "com- 
muting structures" for P = n. The answer may be rooted 



in stability properties of the time evolution. We discuss in 
this section the emergence and particularities of the unitary 
time evolution which is characteristic for quantum mechan- 
ics. 

Time evolution of the subsystem 

Let us consider some continuous time evolution of the 
classical probability distribution {p T }- It relates the en- 
semble at time ti to the ensemble at some earlier time ti, 
and induces a transition from pk(t±) to pkitz), 

Prih) = S T p(t2,h)p p {ti) , pk(h) = S k l(t2,ti)pl(ti). 

(132) 

We may decompose the transition matrix Ski into the prod- 
uct of an orthogonal matrix Ski , which preserves the length 
of the vector (pi . . . , p n ) and therefore the purity, and a 
scaling d, Ski — Skid. For an infinitesimal evolution step 
this implies 

d t Pk(t) = Tkipi(t) + D P k(t) , D = dtlndfah), 

Tki = —Tik = dtSkm{t,ti)Si m (t,ti). (133) 

For a given maximal purity during the evolution, eq. 
(I133P can be rewritten as an equation for the density matrix 
P, 

dtPaP = - l[H, p] a p + Rap-fS ^/0 7 «5 ~ ^7^7^ 

+ D( PaP - jjS a(3 ). (134) 

This corresponds to a split of the infinitesimal SO(n) trans- 
formation Spk — Tkipi into a unitary part corresponding to 
the subgroup SU(M) and represented by the hermitean 
Hamiltonian H = HkL-k + Hq, and remaining rotations of 
SO(n)/SU(M) represented by R or f M , 

Tm = —2fki m Hm+Tki 7 [Lk, Li] = 2ifki m L m , 

Tm — -^RaM8{Lk)fi a {Li) lS . (135) 

In general, H, R and D may depend on pk- 

Unitary time evolution 

We are interested in possible partial fixed points of the 
evolution for which R — and D — 0, while H is in- 
dependent of pk- (Partial fixed points of this type have 
been found explicitly in the classical time evolution of non- 
relativistic boson fields 0.) Then cq. ([134]) reduces 
to the linear von-Neumann equation for the density ma- 
trix. In case of a pure state density matrix this implies the 
Schrodingcr equation idttp — Hip. One recovers the uni- 
tary time evolution of quantum mechanics. The evolutions 
with R = D = are singled out by the property that a 
pure state of the subsystem remains a pure state during 
the evolution. It will be interesting to find out how this 
property is related precisely to the notion of the isolation 
of the subsystem. 

The more general evolution equation away from the "uni- 
tary partial fixed point" can describe "decoherence" [l6[ as 
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a decrease of purity for D < 0, or "syncoherence" as the ap- 
proach to the pure state partial fixed point with increasing 
purity for D > 0. The latter typically accounts for a situa- 
tion where the subsystem described by the observables 
can exchange energy with the environment. An example 
is the evolution from a mixed state of an atom in different 
energy states to a pure state of an atom in the ground state 
by virtue of radiative decay of the excited states. A static 
pure state density matrix obtains as usual as a solution of 
the quantum mechanical eigenvalue problem Hip = Ej?p. 
We suggest that the omnipresence of quantum systems in 
nature is due to the existence of such partial fixed points 
which reflect the isolation of the subsystem. 

The change of the purity is related to D in cq. (I133|) . 

d t P = 2 Pk d tPk = 2DP. (136) 

For D < the purity decreases - this describes decoher- 
ence. Decoherence is not time reversible - an arrow of time 
is singled out by the "direction" of the approach to equipar- 
tition. This also holds for the opposite process of an in- 
crease of purity, i.e. syncoherence. If the time evolution of 
the subsystem is time- reversal invariant, D must vanish. 

The remaining rotations described by Tm in eq. (|133|) 
could, in principle, be equivalent in both time directions. 
The unitary subgroup corresponding to T]y — in eq. (|135[) 
may be singled out by the observation that only such an 
evolution is compatible with the "principle of equivalent 
state and observable transformations" (PESOT), which 
states that instead of a time evolution of the probability 
distribution (or the state of the subsystem) one may equiv- 
alently describe the time evolution by time dependent ob- 
servables 0]. Only unitary transformations preserve the 
spectrum of quantum observables. Then PESOT corre- 
sponds to the well known equivalence of the Schrodinger 
and Heisenberg pictures in quantum mechanics. 

Hamilton operator 

If H is independent of p k it can be considered as an ob- 
servable of the subsystem, H = HkLk + Hq, with fixed 
coefficients Hk,Ho. By Noether's theorem it is associated 
with the energy of the subsystem, where Ej denotes the 
possible energy eigenvalues. (If one wants to use standard 
energy units one replaces H — ► H/K.) On the other hand, 
R and D account for the interactions of the subsystem 
with its environment. They vanish in the limit of "per- 
fect isolation" of the subsystem. If the interactions with 
the environment are strong enough, the subsystem is typ- 
ically not evolving towards the equipartition fixed point, 
Pap = jf^afS, but rather towards a Boltzmann type den- 
sity matrix p ~ exp[— (3{H + /U,iVj)] (for conserved quanti- 
ties Ni and chemical potentials pi), which is close to a pure 
state density matrix if the temperature T — is small 
as compared to the typical separation of the two lowest en- 
ergy eigenvalues Ej. In contrast, if the isolation from the 
environment becomes efficient fast enough, the subsystem 
follows subsequently its own unitary time evolution, as well 
known from quantum mechanics. 

Such a behavior would correspond to the approach to a 
partial fixed point at D = 0, R a f3- ( s — 0, as described in 
[J]. Consider the case where the lowest eigenvalue of H 



is not degenerate. Energy exchange with the environment 
would induce an increase of P until the maximal purity 
P = M — 1 is reached. For small enough T one ends in 
the unique ground state of the system - as characteristic 
for many atoms for the temperature of the earth. This 
scenario could provide a simple explanation why subsys- 
tems with the behavior of an isolated quantum system are 
omnipresent in nature. As well known from quantum me- 
chanics, the uniqueness of the lowest energy state explains 
that the isolated subsystems are all identical, i.e. the iden- 
tity of the atoms. 

At this point we may recapitulate what we have achieved. 
Starting form a classical statistical ensemble we have iden- 
tified a class of classical observables which have the same 
expectation values and admit the same algebra as the op- 
erators in a corresponding M-state quantum system. This 
holds provided the "purity constraint" P < M — 1 (to- 
gether with eqs. ([55]) ') is obeyed. Also the conditional 
correlations which describe measurements of two such ob- 
servables are the same as in the quantum system. Further- 
more, we have found the criteria for the time evolution of 
the classical probability distribution that ensure a unitary 
evolution of the density matrix for the corresponding quan- 
tum system. Obviously, these properties are related to the 
specific subset of classical observables that can describe the 
subsystem, in the sense that no further information about 
the environment is needed for a prediction of the outcome 
of measurements in the subsystem. In the remaining part 
of this paper we will discuss the properties of these specific 
quantum observables in some more detail. 



VII. PROPERTIES OF QUANTUM 
OBSERVABLES 

The quantum structures for probabilistic observables dis- 
cussed so far do not need any specification of the represen- 
tation as classical observables. Many classical systems with 
different states r, classical probabilities p T and classical 
values of the observable in a given state, A T , may describe 
the same state of the subsystem according to eq. (|45|) and 
provide for classical realizations of quantum observables. 
Nevertheless, the implementation of quantum observables 
implies certain restrictions on the possible classical realiza- 
tions. We will discuss those in the following, mainly for the 
purpose of conceptual foundations. One possible realiza- 
tion has been presented in sect. IIVI Here we will discuss 
the issue in a more general context. 

Classical and probabilistic quantum observables 

Classical observables are maps from the set of probability 
distributions fi, with elements {p T } — {pi ■ ■ ■ ,Ps), to real 

numbers, Q A -Z R, with {p T } -► (A^) = £ T p T Ai C) . (For 
simplicity we employ a language with a finite number S 
of classical states, which can be extended to an infinite 
set at the end in some specified limiting procedure.) We 
will restrict the discussion to those elements of ft which 
correspond to "quantum states" , i.e. which obey the bound 
for the purity of the ensemble. In general, the classical 
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observables AW) describe the system and its environment. 
(We use "system" for the (isolated) subsystem or quantum 
system from now on.) 

We are interested in the subclass of quantum observables 
AW) whose expectation values and quantum correlations 
can be computed in the system. (We often use in this sec- 
tion the upper index AW) or A^ in order to underline that 
we deal with observables on the classical level, as specified 
by A T , A T in the classical states r.) On the classical 
level, quantum observables are classical observables with 
special properties. First, for a Af -state quantum system, a 
quantum observable has at most M different classical val- 
ues At — la- (For a non-degenerate spectrum we can 
identify a = a = 1 . . . A'/.) For any given quantum observ- 
able we can classify the classical states r according to the 
value of A T ^ , r = (ja, c 7a ), where for each given j a one 
typically has a large degeneracy of classical states, labeled 
by a la . We can define the probability w a for the occurrence 
of a possible measurement value j a as 

«>.=$>('y«> 7.) . (A^)=Y / Iaw a . (137) 

As a crucial ingredient, w a must be computable from the 
quantities which specify the quantum state, i.e. from the 
expectation values of the "basis observables" (A^) = p/-. 
We assume a linear relation 

Wa(Pk) = 22c ak pk + CqO- (138) 

k 

The probabilities must be normalized, ^2 a w a (pk) = 1, for 
arbitrary pk, which implies the conditions 

E c <^ = ° > J]c a o = 1. (139) 

a a 

For quantum observables the coefficients c a fc,c a o are re- 
stricted further since w a {pk) must obey eq. (|66p. For non- 
degenerate eigenvalues one needs 

Cafe = j^(UL k U^) aa , c a0 = (140) 

for some suitable unitary matrix U. In this case the "quan- 
tum determination" of probabilities (|138|) amounts to a 
condition for the classical probabilities, namely that a uni- 
tary matrix U exists such that 

5>(7a,<0 - jj(l + J2Pki UL kU r ) aa )- (141) 

Thus quantum observables are defined by two properties: 
(i) the restriction of the spectrum to at most M different 
values, (ii) the "quantum determination" of probabilities 
w a (pk)- While (i) only involves a property of the classical 
observable, the second restriction (ii) depends on relations 
to the basis observables and on the selection of possible 
quantum states out of the most general probability distri- 
butions {p T }- For the specification of a quantum observable 
we need at least j a and c a k,c a o- A quantum observable 



AW) has the important property that its classical product 
AW) • AW) (defined by (AW) . #)) r = (A T Q) ) 2 ) is again 
a quantum observable, with spectrum (7^) and the same 
w a as for AW). This extends to higher polynomials and 
arbitrary functions /(AW)). 

We can now associate to any classical quantum observ- 
able AW) a probabilistic quantum observable, AW) — » A, 
which is characterized by the spectrum of possible mea- 
surement values (7 a ) and the associated probabilities w a . 
Only this information will be needed for a computation of 
expectation values ((AW))p) in a "quantum state" of the 
system, while the detailed form of {p T } is not relevant. On 
the level of classical observables the quantum observables 
are characterized by a distribution of values A T ^ = j a 
for the classical states r. This distribution still contains 
much more information than the spectrum "f a and the as- 
sociated probabilities w a . Therefore A T ^ still describes 
the system and partly the environment. Only on the level 
of probabilistic observables A the parts of A^' relevant 
for the environment are projected out, such that A only 
"measures" properties of the system. 

We also can associate to every A^ a quantum operator 
A by a map AW) — > A. It is constructed from eq. ([55]) by 
observing 

<^ (Q) > = E >(ca,o + E Ca ,fePfe) = 4 A) + E 4 A V 

a fc fc 

(142) 

This identifies 

(A) \ -> (A) \ ^ 

a a 

A = 4 A) + E e ^: (AW))=tr(Ap). (143) 
fc 

We note that the map (|143p is possible for arbitrary prob- 
abilistic system observables obeying eq. (|138[) . Without 
the restriction of the type (|140p for quantum observables, 
however, (AW)) 2 will, in general, not be mapped to A 2 . A 
simple example is the random two-level observable R, with 
7! = 1 , 72 = — 1 , c a k = , cio = c 20 = 1/2. It is mapped 
to A = 0, while R 2 = 1. 

In turn, we have a map from the space of quantum oper- 
ators O to the space of probabilistic quantum observables 
V, since for every operator A the spectrum {70} is defined, 
and the probabilities w a can be computed for all quantum 
states {pk} or density matrices p. The latter obtain from 
the diagonal elements (UpU') aa , with U the unitary ma- 
trix used for the diagonalization of A. The map from the 
space of classical quantum observables Q to the probabilis- 
tic quantum observables V is equivalent to the sequence of 
maps Q -► O, O -> V. 

However, the map from the ensemble of classical quan- 
tum observables Q to the space of quantum operators O 
is not invertible. The classical observables AW) involve a 
specification of A T ^ for every classical state r, which is 
much more information than contained in the coefficients 
c a ,fc j c a fi. We may encounter situations where a quantum 
observable 2?W) j s mapped to an operator B, while also a 
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function f(A^) of a different quantum observable 
is mapped to the same operator, f(A) — B. (Here f(A) 
is an operator valued function, while f(A^') is based on 
the classical product A^> ■ A^K) Such a situation does 
not imply an identification of the quantum observables at 
the classical level, i.e. in general one has B^ ^ f(A^). 
This lack of invertibility of the map Q — > O constitutes 
an important difference between our approach and many 
alternative attempts of a "classical formulation of quan- 
tum mechanics" , which associate to each A a unique clas- 
sical observable. For example, this is typically assumed for 
"hidden variable theories". Also for the Kochcn-Specker 
theorem Q the existence of a map A — > A^ is a crucial 
hypothesis, which is not obeyed in our setting. 

Algebra of observables 

On the level of classical observables we always can de- 
fine a linear combination, C — XaA^ + XbB^\ and the 
pointwise product, D — A^ ■ B^\ of two quantum ob- 
servables A^\B^\ where C T = X A A { T Q) + \ B B^ Q) , D T = 
At Bt ■ However, in general neither C nor D are quan- 
tum observables. Consider the simplest case, M = 2, 
and the two basis observables A^> and A^ with spec- 
trum jjp = IcP = ±1- A linear combination C = 
cos &AW+em<&A& has a spectrum 7„ = ± cos ■& ± sin d. 
This observable has four different possible measurement 
values. It can therefore not be a quantum observable of 
the system with M = 2, even though (C) can be com- 
puted in terms of pi t % = (AW>( 2 )). We conclude that 
the "rotated spin" , which corresponds to the operator 
A($) — cos ■& A^ + sin 'dA^, has to be described by a quan- 
tum observable A^ that is again a two level observable 
with spectrum 7 Q = ±1, rather than by a linear combina- 
tion of A^ and A^> of the type C. This necessity arises 
for each value of the angle i? and we have discussed in de- 
tail in Q that this needs a classical ensemble with infinitely 
many classical states r. The reader should note that one 
can define two types of linear combinations. On the level 
of classical observables one can define combinations of the 
type C, while on the level of operators or the associated 
probabilistic quantum observables a natural definition is 
A($) or A^ '('&). In general, a projection on the subsys- 
tem does not map C to A(d) or the probabilistic observable 
A(d). 

The classical product D — A^ ■ A^ has a spectrum 
= il- The condition (i) for a quantum observable 
is obeyed by D. However, the probability u4 for find- 
ing 7! = 1 needs knowledge of the joint probability to 
find A« = l,AW = 1 or A« = -1, A^ = -1. This 
information cannot be extracted from p\ and p2, which 
only yield the probabilities for finding A^> = ±1 (namely 

w±^ — (1 ± /Qi)/2) or for finding A^ = ±1 (namely 

(2) 

w± — (1 ± p2)/2). Nor is it contained in the expecta- 
tion value p3 of the third basis variable for M = 2. We 
conclude that the classical observable D does not obey the 
condition (ii) for a quantum observable. 

We conclude that different algebras can be formulated 
on the levels of classical observables and probabilistic ob- 



servables. The map Q — > V defines equivalence classes 
of classical quantum observables. The algebra of classical 
observables, as defined by D = A ■ B , C = A + B, is 
an algebra defined within the space of all classical observ- 
ables, but the operations of addition and multiplication do 
not remain within the restricted space of classical quantum 
observables Q. They can therefore not be transported to 
the space of probabilistic quantum observables V . On the 
other hand, we have seen in sect. IIIII that a new algebra 
can be defined, acting in the space of the probabilistic ob- 
servables V ', i.e. involving the product (AB) S . This new 
structure is closely related to the operator algebra in quan- 
tum mechanics. It is possible to transport the algebra to 
the space of classical quantum observable Q by selecting a 
fixed classical representative for each possible outcome of 
multiplications and additions. It is not clear, however, if 
this is useful and we will not need such a construction for 
our purposes. 

Representation of quantum observables as classical 
observables 

After these general remarks we now present an explicit 
classical ensemble and classical quantum observables for a 
system with given M. We recall that the classical quan- 
tum observables A^ which are mapped to a given A are 
not unique. Also the specification of the classical states r 
and the corresponding construction of classical observables 
is not supposed to be unique. At the end, all measurable 
information of the system can be expressed in terms of the 
expectation values of quantum operators, such that the de- 
tails of the classical observables do not matter. Only the 
existence of the classical observables in a setting free of 
contradictions is therefore needed in order to demonstrate 
a realization of quantum mechanics as a classical statisti- 
cal ensemble. We have already given such an example in 
sect. IIVI Here we discuss a classical realization closely re- 
lated to it which is, however, somewhat more general since 
restricted sets of quantum observables are covered as well. 

It is sufficient to determine at least one classical quan- 
tum observable for every operator A, i.e. for every her- 
mitean M x M matrix, which is needed for the descrip- 
tion of the system. The quantum observable XA^> , with 
A G R, is mapped to the operator \A. We will there- 
fore restrict our discussion to operators with unit norm, 
say tr A 2 — M. Also the addition of a part proportional 
to the unit observable translates for operators to the ad- 
dition of a corresponding piece proportional to the unit 
operator, A + c — > A + c. We can therefore restrict the 
discussion to traceless operators, trA = 0. We follow a 
simple construction principle. Consider first a single op- 
erator A with a spectrum of m(A) < M distinct eigenval- 
ues A q( ^(A). We associate to it m(A) discrete classical 

states, labeled by a(A) = 1 . . .m(A). In these states the 
classical observable A^ , which is mapped to A, takes the 
values — ^a(A) (A)- Add now a second operator B 

with m(B) distinct eigenvalues A a ^ (B). If this operator is 
"independent" we construct the direct product space with 
states r labeled by the double index r = (a(A), a(i?)) , and 
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= X a(A) (A), B^ Q) = X a{B) {B). This is continued until 
all independent operators are included. As stated above, 
the resulting ensemble has infinitely many classical states 
r, if the number of independent operators is infinite. (A 
well defined sequence of subsequently included operators 
induces a well defined limit process for the construction of 
the ensemble [4|.) Our construction yields explicitly a clas- 
sical quantum observable for every independent operator. 
We recall that many further classical observables that do 
not obey the restrictions for quantum observables can be 
defined in the ensemble. 

Details of the construction will depend on the notion 
of independent operators. As a simple criterion we may 
call two operators independent if tv(A — B) 2 > e, and 
take the limiting process e — > 0. Other more restrictive 
definitions of "independent" may be possible. Different 
contradiction-free definitions of "independent" lead to dif- 
ferent classical realizations, which all result in the same 
quantum properties of the system. The explicit construc- 
tion above has demonstrated that such classical realiza- 
tions exist. Of course, there are also classical ensembles 
with "many more" states than those used in our explicit 
construction. If one is only interested in the quantum ob- 
servables the states f of such a larger ensemble can be 
mapped to the states r of the ensemble used in the con- 
struction by summing the probabilities of all states which 
have the same A T for all quantum observables. 

For our discussion of a classical realization of a quantum 
computer in sect. [Tj] we have used this type of construc- 
tion for the states of the classical ensemble. However, we 
have taken there only the operators or which are 
associated to the basis observables A^ k \ as independent 
operators. This was sufficient for the purpose of resenting 
initialization, gate operations and readout of a quantum 
computer. If we want to describe further quantum ob- 
servables, as arbitrarily rotated spins, we have to include 
further labels for the classical states. Since infinitely many 
"rotation directions" exist, one needs an infinity of classical 
states [4(. This has been implemented in sect. IIVI 



sists of all linear combinations 



Classical sum and product for observables 

We next turn to the conditions under which linear com- 
binations or classical products of two classical quantum ob- 
servables can again be quantum observables. We will find 
that this requires that the associated quantum operators 
commute. For C (Q) = XaA^ + A B B (Q) the expectation 
value can always be expressed in terms of pk . We can there- 



fore compute e k C ^ 



A^e^ + Ase^, 5 '' such that (C) obeys 
eci.([46|). This allows for the construction of an operator C 
(1551) which obeys for all p 



(C) =tr(Cp) - \ A (A) +X B (B) 



tr 



(X A A + X B B)p . 

(144) 

We therefore can identify C = XaA + XbB. At this step, 
however, the spectrum of possible measurement values for 
C does not necessarily coincide with the spectrum of eigen- 
values of C, which is a necessary condition for a quantum 
observable. On the classical level the spectrum of C con- 
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(C) 



7(a,6) 



X A l [ a A) + X Bl l 



(B) 



(145) 



for all possible pairs (a, b) = c. It may be reduced by elimi- 

(c) 

nating those 7c for which the probability w c vanishes for 
all quantum states. Even if the number of different 

values 7c C ' ) obeys < M as realized, for example, if 

Pp) < M, there is no guarantee that all 7c C '' ) coin- 
cide with the eigenvalues of C. 

If is a quantum observable its spectrum must co- 
incide with the spectrum of C. Furthermore, the classical 
product -C^ must also be a quantum observable and 
obey 



(C-C) = X 2 A (A-A)+X 2 B (B-B)^ 
= tr(C 2 p) 

= X 2 A tr(A 2 p) + X 2 B tiiB 2 p) 



We find as a necessary condition that the classical product 
A ■ B must be computable in terms of pk 



2X A X B (A-B) 

(146) 

h X A X B tr( {A, B}p). 



(A-B) 



rtr 



({A,B}p 



(147) 



This has to hold for arbitrary p. If any (nontrivial) lin- 
ear combination of A^ and B^ is a quantum observable 
one concludes that ■ flW must be a quantum observ- 
able with associated operator A ■ B — ► \{A, B}. Similar 
restrictions arise for higher powers of C . 

The condition (|147|) is nontrivial. On the classical level 
we can derive from {p T } the probabilities W( a ,b) that A 

has the value and B takes the value . For 

quantum observables A^\B^ the probabilities 



J2a w (a,b) can be computed from {pk}- 



In general, the information contained in {pk} will not be 

however. It will therefore of- 



sufficient to determine W( a t), 

ten not be possible to express (A-B) = b^"* 1 ^ ' w (a,h) 



in terms of {pk}- Then D = A ■ B cannot be a quantum 
observable. On the other hand, if a linear relation be- 
tween (D) and pk exists, (D) 



(D) 



write (D) = tr(Dp) and eq. (TUT)) implies D = \{A, B}. 

If XaA^+X b B^ is a quantum observable for arbitrary 
XajXb, the associated operators A and B must commute, 
[A, B] = 0. In order to show this, we first consider the case 
where a given — XaJ^ + X B j^ corresponds to a 
unique combination (a, b). If is a quantum observ- 

able, there must exist probability distributions {p T } which 

( c) 

are an "eigenstate" for the "eigenvalue" 7g . This implies 
Wc = 1 , w c ^c = or Wfafy = 1 , I0( a ,6) = if a ^ a or 
b ^ b and therefore w a = 1 , w a ^ a = , log = 1 , w b ^i = 0. 
We conclude that this state is also a simultaneous eigen- 
state of the observables A^> and B^\ with respective 
eigenvalues and 7^ S '- In particular, we may consider 
a pure state ip s which is an eigenstate of C with eigenvalue 
7^ C) . It must obey Atp 5 = li^^c , ip^A = T^ipg , Bip s = 
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J^tps , ?PcB = 7g V?- We choose a basis where C is di- 
agonal and ip? = ipf = (1,0, . . .0). In this basis A and 
B must be block-diagonal, A a i — ji A ^S a i, A\p — 7g £1/3 

and similar for B. We can repeat this for other eigenvalues 

fen 

of C. If for every eigenvalue 7c of C the composition out 
of eigenvalues olA^Q' and B^ is unique, we can infer that 
A and B must commute. Indeed, in the basis where C is 
diagonal both A and B must be simultaneously diagonal, 
and therefore [A, B] = 0. 

(c) 

In presence of multiple possibilities of composing 7g ' 
from linear combinations of and the discussion 
is more involved. This case appears, however, only for 
particular coefficients Xa , Xb ■ If C is a quantum observ- 
able for arbitrary Xa and Xb such degenerate cases can be 
avoided such that A and B must commute. Indeed, con- 
sider the case of a "degenerate decomposition" for a par- 
ticular pair (Xa, Xb)- This occurs if there are two solutions 

\ (A) . \ (B) (C) , (A) . , (B) (C) 

X A r ai + X B % = 7ci , X A la 2 + X B % 2 = 7c 2 , with 
7 c f ■* = 7^ , 7oi 7^ ■ Performing an infinitesimal shift 
A ,4 — > A^i + (5a, while keeping Xb fixed, results in a separa- 
tion of 7(f) and 7ca C) , 7c2 C) - 7 <? } = * A ( 7 j£> - 7^) ^ 0. 
Then 7^ has a unique composition from 7^ and J^f* ■ 
(We have discussed here the case of two-fold degeneracy 
where other eigenvalues of C are separated from 7cf ' , 7I2 
by a finite distance. Higher degeneracies can be treated 
similarly.) 

Comeasurable quantum observables 

Two classical quantum observables A™ and B^> are 
called "comeasurable quantum observables" if arbitrary lin- 
ear combinations XaA^ + XbB^ are also quantum ob- 
servables. The operators A, B associated to a pair of 
comeasurable observables must commute, [A,B] — 0. Fur- 
thermore, the classical product of two comeasurable quan- 
tum observables is a quantum observable. The associated 
operators and probabilistic observables are given by the 
chain of maps 

^W) . B {Q) -> B} (AB) S . (148) 

For comeasurable quantum observables the classical corre- 
lation is computable from the quantum system and equals 
the quantum correlation. 

We next consider general conditions for the classical 
product being a quantum observable, — A^ ■ B™. 

If for all eigenvalues of flW the decompositions 7^ = 

7s 75 are unique, the operators A and B must again 
commute. In this situation the quantum state specifies the 
probabilities w a for the observable A to have the value ja , 
the analogue for u>b, and in addition the joint probability 
w c = ui{a,b) thai a measurement of A yields 7^ and a mea- 
surement B yields 7^ . Since the associated operators A 
and B commute we may choose a basis where both are di- 
agonal. The probability for the operator AB to take the 
value jc = 7 a 'Y^i with (a, b) — c, is given by the cor- 
responding diagonal element of the density matrix in this 



basis, (p') aa - This must equal w c , and we conclude that for 
all quantum states Ti(ABp) = J2 C w c j { c D) = (D) =Tr(Dp) 
and therefore eq. (|148[) applies. 

Inversely, we cannot infer that for every pair of quantum 
observables A^\ B^\ for which the associated operators 
commute, [A, B] = 0, the classical product A^> ■ 
must be a quantum observable. There is simply no guar- 
antee that the joint probabilities W( a ,b) nn d an expression 
in terms of {pk}- Furthermore, if A^ and are comea- 
surable quantum observables and A ■ B = D is therefore a 
quantum observable, and if F^' is a quantum observable 
with associated operator F — \{A : B}, this does not im- 
ply that (A ■ B) T equals F T . There is a whole equivalence 
class of distinct classical quantum observables which are 
mapped to the operator F, and it would not be clear with 
which one (A- B) T should be identified. We only know that 
(A ■ B) T belongs to the same equivalence class as F T . 

One may understand this issue in more detail in the real- 
ization of four-state quantum mechanics discussed in sect. 
IIV1 In general, even for two different e& for which the op- 
erators commute, as and in eq. (|106[) (with C 
replaced by A here), the classical correlation 

(A - B) — (A) (B) + J2(Spe)rA T B T (149) 

r 

will depend on the environment {(Sp e ) T }. We may, how- 
ever, restrict {(<5p e ) T } by imposing beyond the conditions 
(li?T)) the relations 

((A ■ B) p ) = tv{p(AB) p ) (150) 

for p = 1,2, 3. This enlarges the set of the possible system 
observables and the equivalence class corresponding to the 
operator F = \{A, B} = AB, such that A ■ B is now 
a quantum observable belonging to this equivalence class. 
The explicit construction of sect. II VI shows, however, that 
the classical observable F = A(e^), with F = e^Lk, is 
distinct from the observable A - B. 

It is important that A ■ B and F are in the same equiva- 
lence class, but not identical classical observables. In par- 
ticular, if there is another quantum observable G which is 
comeasurable with F, such that F ■ G is a quantum ob- 
servable, this does not imply that A ■ B ■ G is a quantum 
observable. The classical products of G with two different 
representatives of a given equivalence class may be differ- 
ent. This is important in order to avoid contradictions 
for an implementation of several pairs of comeasurable ob- 
servables. Assume for M = 4 the operator representations 
A —> L 3 , B -► L 2 , F -> Li , G — ► £4. While A ■ B 
can be a quantum observable in the same equivalence class 
as F, and F and G may be realized also as comeasurable 
observables, the classical product A B G is not necessarily 
a quantum observable. If it would be one, also A-G-B and 
G A B would be identical quantum observables since the 
classical product is commutative. On the other hand, the 
operators B and G or A and G do not commute. Such a 
situation can lead to contradictions as we will see when we 
next discuss the notion of bit chains. Those are avoided if 
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A ■ B ■ G is not a quantum observable. We also recall that 
it is only an option to realize pairs of quantum observables 
which are associated to commuting operators as comeasur- 
able observables. This requires additional constraints on 
|(<5p e ) r } which need not to be imposed. 

As an upshot of this discussion we conclude that the 
lack of a map O — ► Q leaves a lot of freedom in the choice 
and properties of the quantum observables on the classical 
level. Generically, linear combinations and classical prod- 
ucts of quantum observables are not quantum observables 
themselves. 

Bit chains 

Finally, we discuss the special setting of "bit chains" . Bit 
chains are sets of bits for which the probability for the out- 
come of ordered measurement sequences as (+,—,+) for 
(bit 1, bit 2, bit 3) can be predicted from the knowledge of 
the state of the subsystem. The simplest bit chain is a set 
of three comeasurable two level observables which we take 
among the set of basis observables A^ . For the example 
M = 4 we may consider the observables T\ , T%, T 3 associ- 
ated to the three commuting diagonal operators L\, L 2 , L 3 . 
A bit chain arises if the expectation values of two (or sev- 
eral) one-bit-observables as well as their (multiple) prod- 
ucts can be determined simultaneously in a quantum sys- 
tem. 

Suppose that the first bit corresponds to T\, the second 
to T2. Each bit can take the two values +1 or —1. We 
can consider measurements of the two bits and construct a 
composite observable which takes the value +1 if the signs 
of measurements of Ti and T 2 are equal, and —1 if they 
are opposite. If the information contained in the state of 
the system is sufficient in order to predict the outcome for 
the combined observable, such that its expectation value 
depends linearly on pk , we can conclude that the combined 
observable must be a quantum observable. The combined 
observable should be in the same equivalence class as T3, 
with corresponding quantum operators obeying L\L 2 = 
L 3 . 

From the expectation values p m = (T m ) , m = 1, 2, 3, we 
can determine the probabilities w 7 . e that a measurement 
of bit one finds the value 7 ± 1 and the measurement of bit 
2 yields e = ±1, namely 

w ie = jC 1 +7Pi + e P2 + 7 e P3)- (151) 

The order of the measurement does not matter here. A 
bit chain is closed in the sense that if T3 is considered as a 
bit and T2 as a second bit, the composite two level observ- 
able is now in the same equivalence class as T\ , according to 
L3L2 = L\. We can compute the probabilities for arbitrary 
sequences of measurements of T\ and T2 if we assume that 
the measurement correlations for the composite observable 
constructed from T\ and T2 are the same as for the bit T3 . 
For four-state quantum mechanics there are many differ- 
ent bit chains, associated to products of two commuting 
operators as L\L± = Lq. 

Comeasurable bit chains 

For a comeasurable bit chain the classical product T\ ■ 
T2 is also a quantum observable, in the same equivalence 



class as T 3 . From the expectation values (Ti) , (T 2 ) , (T 3 ) 
we can determine all probabilities p++ etc. for the four 
possibilities of values (+, +) , (+, — ) , (— , — ) and (— ,+) 
for bits one and two [3j . In summary, for T\ , T2 and T3 
to form a comeasurable bit chain we restrict the classical 
probability distribution p T such that the expectation values 
obey 

(Ti-T 2 ) = (T 3 ) , (Ti-T 3 ) = (T 2 ), 

(T 2 • T 3 ) = (Ti) , (Tx • T 2 • T 3 ) = 1. (152) 

This can be easily generalized: for a comeasurable bit 
chain of quantum observables with P members Tj , j = 
1 . . . P, all mutual classical products are members of the 
bit chain, such that Ti ■ Tj is in the equivalence class of 
CijkTk\(i ^ j), with ctjk = cjik = 1 for one particular 
combination k) and zero otherwise. All associated op- 
erators Tj mutually commute, and T{Tj = c^T^ , T? = 1. 
This extends in a straightforward way to classical products 
of an arbitrary number of members of the bit chain. One 
infers that all linear combinations \Ti + XjTj are quantum 
observables, represented by the operators A;Ti + XjTj. 

For given M the maximal number of members of a bit 
chain is P = M — 1 and we call such chains "complete bit 
chains" . This restriction follows simply from the maximal 
number of mutually commuting operators. The presence of 
a bound for P poses certain restrictions on the classical re- 
alizations of comeasurable bit chains associated to different 
sets of mutually commuting operators. 

As an example, consider the case M — 8. A possible 
complete three bit chain with seven members can be 
associated to the operators C\ — ► (r 3 ® 1 ® 1) , C2 — > 

(1®T 3 ®1) , C 3 -> (1<8>1<8>T 3 ) , Cl -> (1®T 3 ®T 3 ) , C 2 -> 

(t 3 ® 1 ® r 3 ) , C 3 -> (r 3 <g> r 3 <g> 1) , C -» (r 3 ® t 3 <g> r 3 ). 
For this "C-chain" one has for the operators (we 

omit hats here) CjCj — C = C\C%C 3 for all 
j = 1,2,3 , C1C2 — C3. Alternative candidates for 
complete three bit chains are the "A-chain" where T3 
is replaced by n, or the "B-chain" which obtains from 
the C-chain by the replacement r 3 — > r 2 , Further can- 
didates are the ".F-chain" (Ci, A 2l A 3 , Ti, F 2 , F 3 , F), 
"G-chain" {At, C 2 , A 3 , Gi, G 2 , G 3 , G) or "/f-chain" 

(Ai,A 2 , C 3 ,Hi,H2,H 3 ,H), with analogous multiplication 
structures given by the order of the elements in the list, i.e. 

F -> (rsOnOn) , G -> (ri(g)T 3 (g)Ti), H -> (n ®n ®r 3 ). 
Finally, we may consider a possible candidate "Q-chain" 

(F,G,H,Qi,Q 2 -Q3,Q), with Q -(t 3 0t 3 0r 3 ). 

If all these sets of observables are simultaneously realized 
as comeasurable bit chains, and if A ■ B = C , C ■ D = E 
would imply A- B - D = E, we would run into contradiction. 
Here we denote by A = B that A and B are in the same 
equivalence class, i.e. that the associated operators obey 

A = B. From the Q-chain we conclude F ■ G ■ H = Q- In 

turn, from the F, G, H chains we infer C\ ■ A2 ■ A 3 = F, 

At ■ C 2 ■ A 3 = G , A\ ■ A 2 ■ C 3 = H- If this would imply 
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Q = C 1 -A 2 -A 3 -A 1 -C 2 -A 3 -A 1 -A 2 -C 3 = C 1 -C 2 -C 3 = C, 

we would find that the operators associated to C and Q 
have opposite sign, showing the contradiction. 

This clearly demonstrates that not every set of two level 
observables for which the associated operators mutually 
commute can be a comeasurable bit chain simultaneously. 
In our case this only poses a consistency condition for the 
possibilities of classical products of quantum observables 
being quantum observables themselves. If we had a map 
A -► A^\ with f(A) -► /(AW) and f(A^) based on the 
classical product, one could show that for every pair of com- 
muting operators A, B the map implies AB — > A ■ B. The 
resulting contradiction is a proof of the Kochen-Specker- 
theorem [f| - actually the above chains of observables cor- 
respond precisely to the elegant proof of this theorem by 
N. Straumann [□]. 

The observation that not all "candidate chains" 
C, F, G, H, Q can be simultaneously comeasurable bit 
chains does not mean that the associated sets of seven 
commuting operators are inequivalent. There is no prob- 
lem to associate to each such operator set a comeasur- 
able bit chain. Only the bit chain associated to the set 

Qi = (73 ® n ® ti) , Q 2 = (ti ® r 3 <g> n) , Q 3 = (n ® n ® 
73) , Q2Q3 , Q3Q1 , Q1Q2 , Q1Q2Q3 = -(7-3 ® T-3 ® 7-3) 

should be a new comeasurable bit chain with quantum ob- 
servables {Qi,Q 2 ,Q 3 , Q 2 ■ Q 3 , Q3 ■ Qi , Qi ■ Q2 , Qi ■ 
Q2 • Q3) which are different from the candidate Q-chain 

(F, G, H, Qi, Q2, Q3, Q) discussed above. This does not 
lead to any contradiction, since the map from quantum 
observables to operators is not invertible. Both the ob- 
servables Q\ and F are mapped to the same operator Qi, 
but the classical product may be a quantum observable for 

Qi • Q2 and not for F ■ Q 2 - In our explicit construction 
of a classical representation of observables we should not 
exclude Q\ from the set of independent operators with the 
argument that it can be obtained as the product of two 
commuting observables. 

VIII. ENTANGLEMENT 

The violation of Bell's inequalities and entanglement are 
often considered as key features which distinguish quan- 
tum mechanics form classical statistical physics. In this 
section we demonstrate that the classical statistical en- 
sembles which correspond to quantum systems can realize 
entanglement. We show that the conditional correlations 
which are appropriate for measurements in the subsystem 
indeed violate Bell's inequalities. We argue that the key in- 
gredient for the violation of Bell's inequalities is statistical 
incompleteness. 

Classical statistical entanglement and violation of 
Bell's inequalities 

Entanglement is a key feature of quantum mechanics. 
Its classical realization is best discussed in the context of 
bit chains. Consider M = 4 and a bit chain of observ- 
ables Ti,T 2 ,T 3 , with corresponding commuting diagonal 



operators Li,L 2 ,L 3 — L\L 2 . We associate a first bit 
to T\ and a second bit to T 2 . The product of measure- 
ments of the two bits is then determined by p 3 = (T 3 ). 
Let us concentrate on a state with p 3 = (T 3 ) = — 1 , 
pi = (Ti) = , p 2 = (T 2 ) = 0. Depending on the 
other pk this may be a pure or mixed state, with purity 
P = 1 + J2k>4:Pk- From pi — we infer an equal prob- 
ability to find for the first bit the values +1 and —1, and 
similar for the second bit from p 2 — 0. On the other hand, 
p 3 = —1 implies a maximal anticorrelation between bits 
one and two. The probabilities vanish for all classical states 
for which both bit one and bit two have the same value, 
corresponding to w++ = w__ = , w |_ = w ^ = 1/2. 

We may assume that a first apparatus measures bit one, 
and a second one bit two. Whenever the first apparatus 
shows a positive result, the second apparatus will neces- 
sarily indicate a negative result, and vice versa. By itself, 
this anticorrelation does not yet indicate an entangled pure 
state. For example, it may be realized by a mixed state 
with pf. = for k > 4 , P = 1, corresponding to a diagonal 
density matrix p = (l/2)diag(0, 1,1,0). 

We may compute the quantum or conditional correlation 
(I120p , (|122|) for measurements of a rotated spin observable 
A(-d) with associated operator A($) = cosi9Li +sinz?Lg, 
together with a second rotated spin observable B(<p) with 
B(ip) = cosipL 2 + sinipLi. One finds for arbitrary pk 

(A(0)B(<p)) = C(^) = ~tr({i(tf),S(^)}p) 

= cos d cos ipp 3 + cos 1} sin ipp 6 (153) 
+ sin "d cos (pp\$ + sin d sin tpp 12 , 

where we recall the representations of the generators Lk 
(|28|) . For all states with p 3 = p\ 2 = — 1 , pe = pxo = 0, 
one obtains the familiar quantum result for two spins with 
relative rotation 

(A(&)B(tp)) = - cos(i9 -(p) = C($ - ip). (154) 

Bell's inequality for local deterministic theories reads for 
this situation 

|C(i?i,0) -C(i? 2 ,0)| <1 + C(0i,0 2 ). (155) 

Witheq. (HSU this reduces to | C{d 1 )-C(tf 2 ) | < l+C^i- 
d 2 ). It is violated for #1 = 7r/2 , d 2 = 7r/4. This clearly 
shows that we have introduced a conditional correlation 
(|120[) within a classical statistical setting which violates 
Bell's inequalities. 

We observe that the contribution ~ p\ 2 to the quantum 
correlation matters. For our choice $1 = 7r/2 , d 2 — ir/4 
the inequality ([155]) reads |C(tt/2,0) - C(tt/4,0)| < 1 + 
C(tt/2, 7t/4). For p 6 = Ps — , p 3 — —1 and general 
p 12 one finds C(ir/2,0) = , C(tt/4,0) = -l/y/2 and 
C(7r/2,7r/4) = pu/\/2. For p i2 = Bell's inequality is 
now obeyed. 

We conclude that the presence of off-diagonal elements 
in the density matrix (in a direct product basis for the 
two entangled spins) plays an important role for the co- 
existence of different complete bit chains. For our exam- 
ple with M — 4, a second bit chain besides Ti,T 2 ,T 3 is 
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given by Tg,T±, T±2, with associated commuting operators 
L 8 , L4, L12 — L S L4. Not only the spins in one direction are 
maximally anticorrelated for P3 = —1, but also the spins in 
an orthogonal direction (represented by Lg,Li) are maxi- 
mally anticorrelated for p±2 = — 1. The quantum state of 
the subsystem allows for a specification of several correla- 
tions by independent elements as P3 and P12 . This possibil- 
ity is closely connected to the use of quantum correlations 
for the calculation of the outcome of two measurements. In 
a setting where only the classical correlations are available, 
a simultaneous implementation of the two two-bit chains 
(Ti , T2, T3) and (Tg, T4, T12) would require more than three 
mutually commuting objects and can therefore not be im- 
plemented for a state with purity P < 3. 

We finally display the classical formulation for two par- 
ticular entangled pure states. They are given by 

P3 = = -epu = -1 , e = ±1. (156) 

The sign e = +1 corresponds to the rotation invariant spin 
singlet state with density matrix 

p = ^(1 - (n ® ti) - (t 2 ® t 2 ) - (r 3 <8> r 3 )), (157) 
and wave function 

V = -^2 - i> 3 ). (158) 

For e = — 1 the relative sign between -0 2 and "03 is positive. 

Probabilistic realism, locality and incompleteness 

It is often stated that Bell's inequalities imply that quan- 
tum mechanics has to abandon either realism or locality. 
We argue here that our implementation of quantum me- 
chanics is compatible both with "probabilistic realism" and 
locality. What is not realized, however, is a notion of "sta- 
tistical completeness" in the sense that joint probabilities 
for arbitrary pairs of observables are available and used for 
the measurement correlation. Statistical completeness is 
often implicitly assumed in the stochastic proofs of Bell's 
inequalities [111 ]. In contrast, our definition of the mea- 
surement correlation, which is based on conditional prob- 
abilities that can be determined by the state of the sys- 
tem alone without additional information from the envi- 
ronment, leads to "incomplete statistics" [13j. This is the 
basic reason why measurement correlations violate Bell's 
inequalities. 

"Probabilistic realism" starts from the premise that the 
most general fundamental description of reality is of sta- 
tistical nature [l3[ . "Elements of reality" , which allow for 
definite predictions, correspond then to values of observ- 
ables as well as to correlations. Let us consider the EPR 
case of two entangled spins, carried by spatially separated 
particles which originate from the decay of a spinless par- 
ticle and therefore have total spin zero. In this case the el- 
ement of reality is the maximal anticorrelation for all spin 
directions, rather than values of individual spins. This el- 
ement of reality is revealed by measurements of both spins 
and has existed already before the first measurement. In 



contrast, the value of one of the spins is maximally unde- 
termined before the first measurement and not an element 
of reality. 

Due to the correlation, the two spins have to be con- 
sidered as one system. Even for an arbitrarily large sepa- 
ration, such that signals cannot be exchanged any longer, 
we cannot divide the system into two independent subsys- 
tems, consisting of one of the spins each. The correlation 
between the two spins is then nonlocal. Nonlocal corre- 
lations are common in classical statistical systems, how- 
ever. As an example we may take ferromagnetism where 
the mean value of the spins is ordered in domains with 
macroscopic size. "Simultaneous measurements" of the 
mean value in spatially separated regions within the do- 
main will find the same mean value, even if no signals can 
be exchanged between the measurements in the two re- 
gions. The only condition for a causal local theory is in 
this case that the nonlocal correlation has been prepared 
in the past by local causal processes. This is precisely what 
happens for the EPR-spins. The maximal anticorrelation 
of the two spins has been "prepared" during the decay of 
the spinless particle, and persists later due to angular mo- 
mentum conservation. (For the antiferromagnet, one may 
invoke that the mean value of the spins and not only the 
correlation could now correspond to an element of phys- 
ical reality. However, we also could consider the system 
somewhat above the critical temperature, where the mean 
value vanishes but correlations persist for macroscopic dis- 
tances.) We conclude that quantum mechanics shares the 
same properties of probabilistic realism and locality as any 
other classical statistical system. 

What is different from many usual classical statistical 
systems as encountered, for example, in classical thermo- 
dynamics, is the property of "incomplete statistics" char- 
acterizing the quantum systems. We advocate that the 
general definition of a measurement correlation for a pair 
of observables cannot be based on joint probabilities for 
the two observables. We believe that this incompleteness 
holds, in principle, for all statistical systems. Complete 
statistics, where the measurement correlation is expressed 
in terms of joint probabilities, obtains only as a special 
limiting case of incomplete statistics. 

The basic reason for statistical incompleteness is the ob- 
servation that statistical completeness is not, in general, 
compatible with the notion of measurements in isolated 
statistical subsystems. Any isolated subsystem is charac- 
terized by system observables. They are a subset of all 
the observables of the larger "total system" , which can be 
regarded as the subsystem and its environment. Isolation 
means that the probability for finding a given value from 
the spectrum of a system observable should be determined 
by the state of the subsystem alone. It should not involve 
additional properties of the environment. Furthermore, it 
should be possible to determine the state of the subsystem 
by a certain number of expectation values of system observ- 
ables. If a measurement of a pair of two system observables 
respects the isolation of the subsystem, the outcome should 
again be determined by the state of the subsystem, without 
invoking further information from the environment. 
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These simple, rather compelling characterizations of the 
notion of an isolated statistical subsystem are not compat- 
ible, in general, with statistical completeness. The basic 
reason is that the mapping from the space of observables 
for the total system (including the environment) to the sys- 
tem observables is not invertible. Different observables of 
the total system are mapped to the same system observ- 
able. Their difference resides only in different properties 
of the environment, whereas from the point of view of the 
subsystem they are all equivalent. The system observables 
define equivalence classes. As we have shown in detail, the 
joint probabilities differ for different representatives of a 
given equivalence class. They are therefore not a prop- 
erty of the equivalence class alone. In other words, the 
joint probabilities are not properties of the system alone, 
but also involve detailed information about the environ- 
ment. They cannot be computed from the information 
which characterizes the state of the subsystem. For this 
reason the joint probabilities cannot be used for predict- 
ing the outcome of measurements in an isolated statistical 
subsystem. A generic measurement correlation, which de- 
termines the outcome of measurements of pairs of system 
observables in terms of the state of the subsystem alone, 
therefore leads to incomplete statistics. 



IX. SEQUENCE OF MEASUREMENTS IN A 
SUBSYSTEM 



We have constructed a consistent implementation of 
quantum mechanics within a classical statistical ensemble, 
which leads to the quantum laws for expectation values of 
observables, correlations between two measurement which 
may violate Bell's inequalities, and the unitary time evo- 
lution. The classical statistical description therefore re- 
produces all the surprising effects of quantum mechanics. 
Many of them are related to our consistent choice of a mea- 
surement correlation for isolated subsystems. In this sec- 
tion we discuss further properties of sequences of measure- 
ments and their close connection to the non-commutativity 
of quantum operators. We restrict the discussion of this 
section to two-level-observables. An extension to measure- 
ments of observables with a spectrum of more than two 
distinct values may need an appropriate generalization. 

Measurement chains 

Consider first the classical statistical ensemble which de- 
scribes the two-state quantum system (M = 2), and a 
chain of three measurements of two-level observables. For 
a sequence of measurements of first C, then B, and fi- 
nally A we compute the probability w+i + that all ob- 
servables are measured to have the value +1, or the prob- 
ability w^^ 30 ^ that A is found to have the value —1, B 
the value +1 and C the value —1, and similarly for other 
combinations. After a measurement of the second observ- 
able B = 1 the system is projected to a density matrix 
Pb+ = 5(1 + B), independently of the first measurement 
of C, such that the conditional probabilities (| 1 1 3[) depend 
only on (AB) m =tr(AS)/2. In consequence, one finds for 



7 ,<5,e = ±l 



'ii SC) = - A {l+l5{AB) rn ){l + 5e{BC) m )w^ s . (159) 

In particular, we may consider the basis observables A = 
C = A^\B = A^ and compute 



(ABA) 1 A 
W_ s ^ ' = -W 



(160) 



This clearly demonstrates that a series of measurements 
which are compatible with the preservation of the isola- 
tion of the subsystem cannot be reduced to a consecutive 
elimination of states of the classical ensemble. For a clas- 
sical elimination process all states r for which A T = — 1 
are eliminated if the first measurement yields A = +1. 
The "classical probability" of finding A = — 1 in later mea- 
surements must therefore be zero, in contrast to the result 
(|160p . Our prescription for conditional probabilities re- 
produces the quantum mechanical feature that the second 
measurement of A^ leaves after the measurement a state 
with equal probabilities to find A^ = ±1, independently 
of the preceeding history. This underlines the particular 
role of measurements of not only eliminating the states 
which contradict the measured value of the measured ob- 
servable, but also reshuffling the probabilities for other ob- 
servables that are not measured. The particular form of the 
modification of probabilities for A^ as a consequence of a 
measurement of A^ 2 \ which results in eq. (|160p . is due to 
the requirement that after a "good measurement" the en- 
semble should still obey the purity constraint. One should 
find a new state of the subsystem for which future mea- 
surements should not depend on the environment. As well 
known, this can be experimentally verified by a sequence of 
three Stern-Gerlach measurements. "Good measurements" 
of properties of the subsystem act similar to polarization 
filters for electromagnetic waves and are closely related to 
the particle- wave duality in quantum mechanics. 

Sequence of measurements for M-state quantum 
systems 

We can use the measurement probabilities (|159[) in order 
to define the three point correlation for a sequence of two- 
level observables with spectrum {+1,-1} [i[ 



{ABC), 



7,S,e 



U ~/Se 



and find 



(ABC) m = ~tr({{A,S},C}p). 



(161) 



(162) 



For M = 2 one may use in eq. (|162[) the identity {A, B} = 
2{AB) m such that (ABC) m = (AB) m {C). We will next 
show that eq. (|162[) holds for minimally destructive mea- 
surements for general M. For this purpose we will have 
to discuss the properties of the projection (|118[) in more 
detail. 

We first note that pa+ becomes formally ill defined for 
states where (A) = — 1, due to the normalization factor 
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~ (1 + (A)) 1 which appears for M > 2 in the terms mul- 
tiplying the d fcim -symbols 



1 . ix ( , , d klm akPiL r , 
PA+ = -<i + A )\l+ 1 + {A) 

(1 - Mp) 
2M(1 + (A)) 



(163) 



This divergence cancels for physical quantities - for exam- 
ple by multiplication with w+ s in eq. (|120|) . For the con- 
ditional probability one obtains 



itr((l 



B)pa+) (164) 

1 ti({B + AB + BA + ABA)p) 
2 + 4(1 + (A)) 



and (tu_)4 = 1 — (w?)'?. The boundedness and positivity 
of these expressions for (A) — > — 1 may not seem obvious. 
It is more convenient to work here with an unnormalized 
density matrix p~A+ after the first measurement, and per- 
form the proper normalization at the end. We define p~A+ 
in terms of the projector Pa+ 



l + A 

PA+ = Pa+pPa+ , Pa+ = —z— , P 



P. 



A+, 



trp^H 



(1 + (A))/2. 



(165) 



Associating the change after a measurement to a projec- 
tion onto an unnormalized state has the advantage that 
a subsequent measurement can be described by the same 
procedure. In particular, a second measurement of A does 
not change the state further. The map p — > pA+ is now 
linear. With (u>+)+ + (io?)+ = 1 we can express 



1 + i? 



<)+--(«£)+- 



«)+' + («£)+■' 



The normalization drops out of the ratio R, 

tv{Bp A+ ) 



R = 



trp^n 



(166) 



(167) 



and we obtain < (u>+)+ < 1 provided — 1 < R < 1. Since 
B has eigenvalues ±1 we can infer 



trpA+ < tr(Bp A+ ) < trpA^ 



(168) 



using the positivity of all diagonal elements, (p A +) aa > 0, 
in a basis where B is diagonal. (We can infer [pA+)aa > 
in a basis where A is diagonal and Pa+ is a diagonal ma- 
trix with entries 1 and 0. A change of basis by unitary 
transformations preserves this property.) For any limit- 
ing sequence of states p for which (A) approaches —1 and 
therefore tYp~A+ approaches zero from above, we can indeed 
conclude \R\ < 1. Therefore (w+)+ is well defined, positive 
and smaller or equal one for (A) arbitrary close to —1. For 
states with (A) = — 1 the limit may not be unique, but this 
is irrelevant since for further measurements in such states 
only the conditional probabilities (w±)^ arc needed. 



For a product of two conditional probabilities 



K)?K)5 



tr 



l+Al+B 



l + B 



-Pc^ 



2 2 

,B\C 



l+B 



-«) 



(169) 



PC + 



1 



tr(l + A)(l + B)p c+ (l+B) 



the normalization factor for {wr)?l involves the expecta- 
tion value of B evaluated for a density matrix pc+- This 
follows since A has to be evaluated for a density matrix 
Pb+, arid ps+ obtains from eq. () 1 1 8[) by replacing A — > B 
and p — > pc+- The normalization factor is exactly can- 
celed by («;+)+. We can use eq. (|169j) in order to derive 
the identity 

+ ^ A ) B ^ B ) C ± = «f± , D = \{A, B}, 

(170) 

where the observable D is represented by the anticommu- 
tator of A and B. Similarly, one has 



(171) 



We may therefore define a "combined observable" D = 
AoB for the measurement of A after B, where the results 
of the measurements of A and B are multiplied. In other 
words, D is again a two level observable, which takes the 
value +1 if A and B have the same value, whereas D = — 1 
if A and B have opposite values. The conditional proba- 
bilities (u>+)+ etc. can be obtained if the observable D is 
represented by the operator D = {A, B}/2. One concludes 
that the order of the measurements of A and B does not 
matter. Eqs. (|170[) . (|17ip can directly be used for the mea- 
surement correlation p61[) for sequences of measurements, 



(ABC) m = {DC) m = ~tT{{D.C} n 



-tr[{{A,B},C}p 



(172) 



and we recover eq. (|161|) . The generalization to more than 
three measurements is obvious - one starts from the left to 
group pairs into combined observables. We emphasize that 
the order matters. For (ABC) m we cannot group BC in a 
combined observable, since the latter would involve a sum 
over +1 and —1 values of B, combined with the appropriate 
+1 and —1 values of C, while the combinations appearing 
in (ABC) m cannot be factorized in this way. 

Quantum commutator 

The commutator between two quantum operators can 
be related to the issue of ordering of a sequence of three 
measurements. Indeed, we find for the difference between 
two measurement correlations for three measurements eval- 
uated in a different order 



(ABC) m 


- (ACB) m 


1 

4 


(ABC) m 


- (CBA) m 


1 

4 


(ABC) m 


- (BAC) m 


= 0. 



\tr([B,[A,6\]p 



(173) 
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The non-commutativity of (ABC) m is deeply rooted in the 
use of conditional probabilities, since AB can be combined 
to a "composite observable", but not BC. The classical 
correlation, where {ABC) = (ACB), is a special case for 
which the commutator vanishes for all pairs of observables. 
We observe the non-commuting structure of the three point 
correlation even if relations like 



(«?)M)? + («?)V)? 



(174) 



hold, as in our case, such that (AB) m = (BA) m . The 
measurement correlation that we propose for our classical 
statistical setting reproduces a sequence of quantum mea- 
surements. For example, if we consider M = 2 and two 
orthogonal basis observables A^ and A^ 2 \ we obtain 

(A^A^A^) m = , (A^A^A^) m = (A^). (175) 

The commutator can be rooted directly in the condi- 
tional probabilities 



(«4)+<- = \(l±(A) + (B)±(AB) m ) 



±±tr([[B,A],B]p 



(176) 



For example, the probability of finding A = +1 after a 
measurement of B 1 

1 



K)^< s + K)V, s = -(i + (A)) 



tr( [[B,A],B]p 



(177) 



differs by the last commutator term from the probability 
of finding A = 1 without a measurement of B, is given by 
w? s = (1 + (A))/2. This directly reflects the modifications 
of the probabilities for finding A = ±1 as a consequence of 
a measurement of B. As we have discussed above, this is a 
necessary property if the measurement of B is compatible 
with the isolation of the subsystem, i.e. if predictions for 
the subsystem can be done without invoking knowledge of 
the environment. 

The commutator also characterizes the difference be- 
tween the probabilities of first measuring B — 1 and then 
A = 1 or first A = 1 and then B = 1, 



(«?)V 



i 

16' 



tr([[B,A],(B + A)]p 



(178) 



Similarly, one finds 

K)f <, - ( W S )M. S = -^tr([[S, i], (3 - . 



(179) 



The r.h.s of eqs. (|178j) . (|179p changes sign if we switch the 
sign of all values of A and B, leading to a commutative 
measurement correlation (AB) m = (BA) m . For commut- 
ing observables the order of the measurements does not 



matter for any pair of possible outcomes. This applies, 
in particular, for two observables with support in regions 
with spacelike separation, which have to commute in order 
to avoid contradictions with causality. 

Sequence of four measurements and uncertainty re- 
lation 

Consider sequences of four measurements, where two dis- 
tinct two-level-observables A and B (A 2 = B 2 = 1) are 
measured in a different order. With 



1 



{ABCD) rn = -tr(p{{{A,B},C},D}) 



(180) 



we may compute the difference of measurement correlations 

(ABAB) m + (BABA) m - {A 2 B 2 ) m - (B 2 A 2 ) m = 
hr{p([A,B}) 2 }, (181) 

where A and B are the quantum operators associated to A 
and B. This relates the expectation value of the squared 
commutator directly to the difference of measurement se- 
quences in different orders. Also recall Heisenberg's uncer- 
tainty relation 



AA 2 AB 2 > ~ 



tr{p([i,B])}| , (182) 



which holds for pure states of our classical statistical en- 
semble. For [A, B] proportional to the unit matrix this 
relates AA 2 AB 2 directly to the sequence of measurements 
CH]). 

Products of quantum observables 

The isomorphism between equivalence classes of quan- 
tum observables and the quantum operators allow for the 
introduction of two different products of quantum observ- 
ables. A product of two quantum observables should again 
be a quantum observable and is therefore represented by a 
hermitean operator. We have already discussed before the 
symmetric product (AB) S — (BA) S which is represented 
by the anticommutator 



{AB) S = {BA) S 



(183) 



The measurement correlation is described by the expecta- 
tion value of this symmetric product, 



(AB) rn = ((AB) S ). 



(184) 



Further, we can associate an antisymmetric product (AB) a 
to the commutator multiplied by —i/2, 



(AB) a = -(BA) a <->--[A,B]. 



(185) 



The antisymmetric product characterizes the difference be- 
tween three point correlations in different order 



(ABC) m - (ACB) m = -((A(BC) a )J. 



(186) 
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In this context we emphasize that the products (AB) S 
and (AB) a are the quantum observables which are asso- 
ciated to the operators ^{A,B} and — ^[A,B]. For ex- 
ample, if {A, B} — 0, as for M = 2 and A, B represent- 
ing two orthogonal spins, the product (AB) S is the zero- 
observable with all eigenvalues zero. This differs from the 
combined observable D = Ao B discussed before. The lat- 
ter is a two level observable, with D 2 = 1, in contrast to 
((AB) S ) = 0. The observable AoB is mapped to the oper- 
ator D = \{A : B} - the same as for (AB) S . It is, however, 
a system observable which is not a quantum observable. 
For example, D 2 = 1 is not represented by D 2 . We re- 
call that different system observables can be mapped to 
the same operator. The inverse mapping from operators to 
probabilistic observables is only defined uniquely if the ob- 
servable is specified to be a quantum observable. Examples 
for complete observable systems are classical realization off 
all possible observables for M-state quantum mechanics or 
of a maximal subset of commuting observables. 

If a classical ensemble can describe a subsystem which 
admits a complete set of classical quantum observables, all 
measurement correlations can be expressed as expectation 
values of appropriate observables. We note that the mea- 
surement correlations for the subsystem can be computed 
from the state of the subsystem even if the system of ob- 
servables is incomplete, for example of (AB) S has no real- 
ization as a classical observable. It is interesting to observe, 
however, that for complete observable systems the mea- 
surement correlations have even formally the same states 
concerning "reality" as all other quantum observables. 

Complex structure 

All our discussion so far concerns only real physical quan- 
tities: the spectrum of observables comprises only real val- 
ues, the probabilities for finding a value from this spectrum, 
the conditional probabilities for sequences, as well as ex- 
pectation values and correlation functions are real. The 
state can be characterized by real numbers pk, and the 
time evolution dtpk = Fk involves real quantities Fk- Also 
the prodcuts (AB) S and (AB) a are "real observables" with 
a real spectrum. Nevertheless, we have found a convenient 
expression of all quantities in terms of complex M x M 
matrices. 

The apparent complex structure in quantum mechanics 
can be related to the existence of two different product 
structures (AB) S and (AB) a for quantum observables. In- 
deed, we may define a complex product as 

(AB) C — (AB) S + i(AB) a . (187) 

On the level of operators it is represented by the complex 
operator prodcut 

{AB) C <-> AB, (188) 

consistent with \{A,B) + i (-%[!, B] \ = AB. This al- 
lows us to associate to AB the two products of quantum 
observables (AB) S and (AB) a . The isomorphism between 
the algebra of quantum operators and probabilistic quan- 
tum observables can be extended to operators which are 
no longer necessarily hermitean anymore. 



Complete observable systems 

We call a system of quantum observables "complete" 
if for arbitrary pairs (A, B) of quantum observables their 
products (AB) S and (AB) a are also quantum observables 
of the system. Since (AB) S and (AB) a can always be de- 
fined as probabilistic quantum observables with the ap- 
propriate spectrum (7 Q ) and associated probabilities w a , 
an incomplete system of observables may be completed by 
adding the quantum observables associated to the products 
(AB) s , a . This is always possible on the level of probabilis- 
tic observables, but not necessarily on the classical level. 
Examples for complete observable systems are classical re- 
alizations of all possible observables for M-state quantum 
mechanics or of a maximal subset of commuting observ- 
ables. 

If a classical ensemble can describe a subsystem which 
admits a complete set of classical quantum observables, all 
measurement correlations can be expressed as expectation 
values of appropriate observables. We note that the mea- 
surement correlations for the subsystem can be computed 
from the state of the subsystem even if the subsystem of 
observables in incomplete, for example if (AB) S has no real- 
ization as a classical observable. It is interesting to observe, 
however, that for complete observable systems the mea- 
surement correlations have even formally the same status 
concerning "reality" as well as other quantum observables. 



X. CONCLUSIONS 

We have obtained all laws of quantum mechanics from 
classical statistics, including the concept of probability am- 
plitudes ip and the associated superposition of states with 
interference and entanglement, as well as the unitary time 
evolution. Our classical statistical description is genuinely 
probabilistic and not a local deterministic model. It al- 
lows to predict probabilities for the outcome of a chain of 
measurements, but not a deterministic result of a given 
measurement in terms of some "hidden variables" . Bell's 
inequalities can indeed be violated for our formulation of a 
correlation function which is based on conditional proba- 
bilities for a sequence of measurements. Since the mapping 
from classical observables to quantum operators is not in- 
vertible, no contradiction to the Kochen-Specker theorem 
arises. 

Our setting can be extended to include observables like 
location and momentum by considering many two-level ob- 
servables on a space-lattice and taking the limit of van- 
ishing lattice spacing [T^ |. Both quantum particles and 
classical particles can be described by appropriate classi- 
cal statistical ensembles. Even a continuous interpolation 
becomes possible. As a function of some continuous param- 
eter 7 in the interval [0, J], the classical ensembles describe 
for 7 = a quantum particle - "passing simultaneously" 
through the two slits in a double slit experiment and pro- 
ducing an interference pattern - or for 7 = tt/2 a classical 
particle that passes only through one of the slits. A contin- 
uous interpolation becomes possible for intermediate values 
of 7 0. 
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In our statistical mechanics setting quantum mechan- 
ics describes isolated subsystems of a larger ensemble that 
also includes the environment. Isolation does not mean 
that the subsystem can be described by classical probabili- 
ties for the states of the subsystem and sharp values of the 
observables in these states. It rather relates to a separated 
time evolution of the subsystem and to observables which 
can be described by quantities only associated to the sub- 
system, without explicit reference to the environment. The 
question why Nature shows a strong preference for subsys- 
tems that are described by quantum mechanics can now be 
addressed within the general framework of classical statis- 
tics which allows, in principle, also subsystems without the 
characteristic features of quantum physics. We conjecture 
that the answer is related to particular stability proper- 
ties for the time evolution of subsystems with a quantum 
character Q. 

The deep question if small deviations from quantum me- 
chanics are possible, and of what nature they could be, 
finds in a classical statistical setting an appropriate frame- 
work to be addressed. The overall probabilistic description 
of the whole world (or the entire reality) involves an in- 
finity of states or degrees of freedom both in the quantum 
and the classical statistical description. It is well conceiv- 
able that they can be mapped onto each other such that 
in this very general sense no deviations from quantum me- 
chanics occur. The perhaps more interesting issue concerns 
the possible descriptions of isolated subsystems, as isolated 
atoms or isolated M-state quantum systems. For such sys- 
tems it seems possible that quantum mechanics is only a 
very good approximation, but small deviations could oc- 
cur. The following discussion shows that such deviations 
from simple M-state quantum systems or from the ideal 
quantum particle do actually occur in practice. They are 
often related to phenomena as decoherence or imperfect 
measurements, for which the quantum character of the de- 
scription can only be restored if the environment is included 
in the description. 



Consider a subsystem that is well approximated by M- 
state quantum mechanics, in the sense that only the ex- 
pectation values pk are available for a description of the 
state of the system and that the quantum mechanical laws 
hold to a good approximation. There are several types of 
possible small deviations for an exact quantum behavior, 
(i) The time evolution may not be unitary. In particular, 
the purity may decrease (decoherence) or increase (synco- 
herence). (ii) The time evolution may not be linear. This 
happens if the Hamiltonian operator H , or more generally 
the matrix T^i and D in eq. (|133|) . depend themselves on 
Pk- (hi) If the pk violate during the time evolution the 
purity constraint ([58]) - ([62]) . we expect phenomena as the 
violation of the quantum mechanical uncertainty relation, 
(iv) Deviations from quantum mechanics may occur if only 
a limited part of the infinitely many classical states neces- 
sary for a complete quantum description is accessible for a 
subsystem (v) Our framework yields also a formalism 
for the description of imperfect measurements. In this case 
the measurement correlation (I122[) , which is based on min- 
imally destructive measurements, has to be replaced by a 
different correlation, adapted to the "imperfection" of the 
measurement. 

We do not intend to enter here the debate if quantum 
mechanics or classical statistics are more fundamental -it 
is well known that classical statistics can be obtained as 
a limiting case of quantum mechanics. In our view classi- 
cal statistics and quantum mechanics are two sides of the 
same medal. This may have far reaching consequences, as 
the possibility that the late time asymptotic state of a clas- 
sical ensemble may be given by the equilibrium ensemble 
of quantum statistics, or that the classical statistical real- 
ization of certain steps in quantum computations can find 
a practical implementation. We find it remarkable that the 
conceptual foundations of quantum mechanics need not to 
go beyond the concepts of classical statistics. 
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